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(54) Methods for producing modified glycoproteins 



(57) Cell lines having genetically modified glyco- 
sylation pathways that allow them to carry out a se- 
quence of enzymatic reactions, which mimic the 
processing of glycoproteins In humans, have been de- 
veloped. Recombinant proteins expressed in these en- 
gineered hosts yield glycoproteins more similar, if not 
substantially identical to their human counterparts. The 
lower eukaryotes. which ordinarily produce high-man- 
nose containing N-glycans, including unicellular and 
multicellular fungi are modified to produce N-glycans 
such as Man5GlcNAc2 or other structures along human 
glycosylation pathways. This is achieved using a com- 



bination of engineering and/or selection of strains which: 
do not express certain enzymes which create the unde- 
sirable complex structures characteristic of the fungal 
glycoproteins, which express exogenous enzymes se- 
lected either to have optimal activity under the condi- 
tions present in the fungi where activity Is desired, or 
which are targeted to an organelle where optimal activity 
is achieved, and combinations thereof wherein the ge- 
netically engineered eukaryote expresses multiple ex- 
ogenous enzymes required to produce "human-like" 
glycoproteins. 
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Description 

Cross Reference To Related Applications 

5 [0001] Priority is claimed to U.S. Provisional Application Serial No. 60/214,358. filed on June 28, 2000. U.S. Provi- 
sional Application Serial No. 60/21 5.638, filed on June 30, 2000, and U.S. Provisional Application Serial No. 60/279.997. 
filed on March 30.2001. 

FIELD OF THE INVENTION 

10 

[0002] The present invention is directed to methods and compositions by which fungi or other eukaryotic microor- 
ganisms can be genetically modified to produce glycosylated proteins (glycoproteins) having pattems of glycosylation 
similar to glycoproteins produced by animal cells, especially human cells, which are useful as human or animal ther- 
apeutic agents. 

15 

BACKGROUND OF THE INVENTION 

Glycosylation Pathways 

20 [0003] De novo synthesized proteins may undergo further processing in cells, known as post-translational modifica- 
tion. In particular, sugar residues may be added enzymatically, a process known as glycosylation. The resulting proteins 
bearing covalently linked oligosaccharide side chains are known as glycosylated proteins or glycoproteins. Bacteria 
typically do not glycosylate proteins; in cases where glycosylation does occur it usually occurs at nonspecific sites in 
the protein (Moens and Vanderieyden, Arch. Microbiol. 19971 68(3): 169-1 75). 

25 [0004] Eukaryotes commonly attach a specific oligosaccharide to the side chain of a protein asparagine residue, 
particularty an asparagine which occurs in the sequence Asn-Xaa-Ser/Thr/Cys (where Xaa represents any amino acid). 
Following attachment of the saccharide moiety, known as an A/-glycan, further modifications may occur in vivo. Typically 
these modifications occur via an ordered sequence of enzymatic reactfons, known as a cascade. Different organisms 
provide different glycosylation enzymes (glycosyltransferases and glycosidases) and different glycosyl substrates, so 

30 that the final composition of a sugar side chain may vary mari<edly depending upon the host 

[0005] For example, microorganisms such as filamentous fungi and yeast (lower eukaryotes) typically add additional 
mannose and/or mannosylphosphate sugars. The resulting glycan is known as a "high-mannose" type or a mannan. 
By contrast, In animal cells, the nascent oligosaccharide side chain may be trimmed to remove several mannose 
residues and elongated with additional sugar residues that typically do not occur in the A/-glycans of lower eukaryotes. 

35 See R.K. Bretthauer, et al. Biotechnology and Applied Biochemistry, 1999, 30, 193-200; W. Martinet, et al. Biotech- 
nology Letters, 1998. 20, 1171-1177; S. Weikert. et al. Nature Biotechnology, 1999, 17,1116-1121; M. Malissard, et al. 
Biochemical and Biophysical Research Communications, 2000, 267,169-173; Jarvis, et al. 1998 Engineering N-glyc- 
osylation pathways in the baculovims-insect cell system, Current Opinion in Biotechnology, 9:528-533; and M. 
Takeuchi, 1997 Trends in Glycoscience and Glycotechnology,1997 , 9, S29-S35. 

40 [0006] The N-glycans that are produced in humans and animals are commonly referred to as complex A/-glycans. A 
complex A^-glycan means a structure with typically two to six outer branches with a sialyllactosamine sequence linked 
to an inner core structure Man3GIcNAc2. A complex N-glycan has at least one branch, and preferably at least two, of 
alternating GlcNAc and galactose (Gal) residues that temiinate in oligosaccharides such as, for example: NeuNAc-; 
NeuAca2-6GalNAca1-; NeuAca2-3Gaipi-3GalNAca1-; NeuAca2-3/6Gaipi-4GlcNAcp1-: GlcNAcal^Gaipi -(mucins 

45 only); Fuca1-2Galp1 -(blood group H). Sulfate esters can occur on galactose, GalNAc, and GlcNAc residues, and phos- 
phate esters can occur on mannose residues. NeuAc (Neu: neuraminic acid; Ac:acetyl) can be 0-acetylated or replaced 
by NeuGI (A/-glycolylneuramlnic acid). Complex W-glycans may also have intrachain substitutions of bisecting GlcNAc 
and core fucose (Fuc). 

[0007] Human glycosylation begins with a sequential set of reactions in the endoplasmatic reticulum (ER) leading 
50 to a core oligosaccharide structure, which is transferred onto de novo synthesized proteins at the asparagine residue 
in the sequence Asn-Xaa-Ser/Tbr (see Figure 1 A). Further processing by glucosidases and mannosidases occurs In 
the ER before the nascent glycoprotein Is transferred to the early Golgi apparatus, where additional mannose residues 
are removed by Golgi-specific 1 ,2-mannosidases. Processing continues as the protein proceeds through the Golgi. In 
the media! Golgi a number of modifying enzymes including A/-acetylglucosamine transferases (GnT I, GnT II, GnT III, 
55 GnT IV GnT V GnT VI), mannosidase II, fucosyltransferases add and remove specific sugar residues (see Figure IB). 
Finally in the trans Golgi, the A^glycans are acted on by galactosyl tranferases and sialyltransferases (ST) and the 
finished glycoprotein is released from the Golgi apparatus. The protein A/-glycans of animal glycoproteins have bi-, tri-. 
or tetra-antennary stmctures, and may typically include galactose, fucose, and A/-acetylglucosamine. Commonly the 
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terminal residues of the N-glycans consist of sialic acid. A typical structure of a human /V-glycan Is shown in Figure 1 B. 
Sugar Nucleotide Precursors 

5 [0008] The A/-gtycans of animal glycoproteins typically include galactose, fucose, and tenninal sialic acid. These 
sugars are not generally found on glycoproteins produced in yeast and filamentous fungi. In humans, the full range of 
nucleotide sugar precursors (e.g. UDP-A/-acetylglucosamlne, UDP-/V-acetylgaIactosamine, CMP-W-acetylneuraminic 
acid, UDP-galactose, GDP-fucose etc.) are generally synthesized in the cytosol and transported into the Golgi, where 
they are attached to the core oligosaccharide by glycosyltransferases. (Sommers and Hirschberg, 1981 J. Cell BioL 

10 91(2): A406-A406; Sommers and Hirschberg 1982 J. B/o/.C/)em.257(18): 811-817; Perez and Hirschberg 1987 Meth- 
ods in Enzymology ^3S: 709-715. 

[0009] Glycosyl transfer reactions typically yield a side product which is a nucleoside diphosphate or monophosphate. 
While monophosphates can be directly exported in exchange for nucleoside triphosphate sugars by an antiport mech- 
anism, diphosphonucleosides (e.g. GDP) have to be cleaved by phosphatases (e.g. GDPase) to yield nucleoside 

15 monophosphates and inorganic phosphate prior to being exported. This reaction is important for efficient glycosylation; 
for example, GDPase from S. cerevisiae has been found to be necessary for mannosylation. However the GDPase 
has 90% reduced activity toward UDP (Beminsone et al.. 1994 J. Biol. Chem. 269(1 ):207-21 la). Lower eukaryotes 
typically lack UDP-specific diphosphatase activity in the Golgi since they do not utilize UDP-sugar precursors for Golgi- 
based glycoprotein synthesis. Schlzosaccharomyces pombe, a yeast found to add galactose residues to cell wall 

20 polysaccharides (from UDP-galactose) has been found to have specific UDPase activity, indicating the requirement 
for such an enzyme (Beminsone et al., 1994). UDP is known to be a potent inhibitor of glycosyltransferases and the 
removal of this glycosylation side product is important in order to prevent glycosyltransferase inhibition in the lumen 
of the Golgi (Khatara et al., 1974). See Beminsone, P., et al. 1995. J. BloLCbem 270{24): 14564-14567; Beaudet. L. 
et al. 1998 Abe Transporters: Biochemical, Cellular, and Molecular Aspects, 292: 397-413. 

25 

Compartmentalization of Glycosylation Enzymes 

[0010] Glycosyltransferases and mannosidases line the inner (luminal) surface of the ER and Golgi apparatus and 
thereby provide a catalytic sur^ce that allows for the sequential processing of glycoproteins as they proceed through 

30 the ER and Golgi networic. The multiple compartments of the cis, medial, and trans Golgi and the trans Golgi Network 
(TGN), provide the different localities in which the ordered sequence of glycosylation reactions can take place. As a 
glycoprotein proceeds from synthesis in the ER to full maturation in the late Golgi or TGN, it is sequentially exposed 
to different giycosidases. mannosidases and glycosyltransferases such that a specific A/-glycan structure may be syn- 
thesized. The enzymes typically include a catalytic domain, a stem region, a membrane spanning region and an N- 

35 terminal cytoplasmic tail. The latter three structural components are responsible for directing a glycosylation enzyme 
to the appropriate locus. 

[0011] Localization sequences from one organism may function in other organisms. For example the membrane 
spanning region of a-2,6-sialyltransferase (a-2,6-ST) from rats, an enzyme known to localize in the rat trans Golgi. 
was shown to also localize a reporter gene (invertase) in the yeast Golgi (Schwientek, et al., 1995). However, the very 

40 same membrane spanning region as part of a full-length of a-2,6-siaIyltransferase was retained in the ER and not 
further transported to the Golgi of yeast (Krezdom et al., 1994). A full length GalT from humans was not even synthe- 
sized in yeast, despite demonstrably high transcription levels. On the other hand the transmembrane region of the 
same human GalT fused to an invertase reporter was able to direct localization to the yeast Golgi. albeit it at low 
production levels. Schwientek and co-workers have shown that fusing 28 amino acids of a yeast mannosyltransferase 

45 (Mnti ), a region containing an N-temninal cytoplasmic tail, a transmembrane region and eight amino acids of the stem 
region, to the catalytic domain of human GalT are sufficient for Golgi localization of an active GalT (Schwientek et al. 
1995 J. Biol. Chem. 270(1 0):5483-5489). Other galactosyltransferases appear to rely on interactions with enzymes 
resident in particular organelles since after removal of their transmembrane region they are still able to localize properiy. 
[001 2] Irhproper localization of a glycosylation enzyme may prevent proper functioning of the enzyme in the pathway. 

50 For example Aspergillus nidulans, which has numerous a-1 ,2-mannosidases (Eades and Hintz, 2000 Gene 255(1): 
25-34), does not add GlcNAc to Man5GlcNAc2 when transformed with the rabbit GnT I gene, despite a high overall 
level of GnT I activity (Kalsner et al., 1 995). GnT I, although actively expressed, may be incorrectly localized such that 
the enzyme is not in contact with both of its substrates: the nascent A^-glycan of the glycoprotein and UDP-GlcNAc. 
Alternatively, the host organism may not provide an adequate level of UDP-GlcNAc in the Golgi. 

55 

Glycoproteins Used Therapeutically 



[0013] A significant fraction of proteins isolated from humans or other animals are glycosylated. Among proteins 
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used therapeutically, about 70% are glycosylated. If a therapeutic protein is produced in a microorganism host such 
as yeast, however, and is glycosylated utilizing the endogenous pathway, its therapeutic efficiency is typically greatly 
reduced. Such glycoproteins are typically immunogenic in humans and show a reduced half-l'ife in vivo after adminis- 
tration (Takeuchi. 1997). 

5 [0014] Specific receptors In humans and animals can recognize terminal mannose residues and promote the rapid 
clearance of the protein from the bloodstream. Additional adverse effects may include changes in protein folding, 
solubility, susceptibility to proteases, trafficking, transport, compartmentalization, secretion, recognition by other pro- 
teins or factors, antigenicity, or allergenicity. Accordingly, it has been necessary to produce therapeutic glycoproteins 
in animal host systems, so that the pattern of glycosylation is identical or at least similar to that in humans or in the 

10 Intended recipient species. In most cases a mammalian host system, such as mammalian cell culture, Is used. 

Systems for Producing Therapeutic Glycoproteins 

[0015] In order to produce therapeutic proteins that have appropriate glycoforms and have satisfactory therapeutic 
15 effects, animal or plant-based expression systems have been used. The available systems include: 

1 . Chinese hamster ovary cells (CHO), mouse fibroblast ceils and mouse myeloma cells (Arzneimittelforschung. 
1998 Aug;48(8):870-880); 

2. transgenic animals such as goats, sheep, mice and others (Dente Prog. Clin. Biol. 1 989 Res. 300:85-98, Ruther 
20 et al,. 1988 Cell 53{6):847-856; Ware. J., et al. 1993 Thrombosis and tiaemostasis 69(6): 1194-1194; Cole. E. S., 

et al. 1994 J.CelLBiochem. 265-265); 

3. plants {Arabidopsis thaiiana, tobacco etc.) (Staub, et al. 2000 Nature Biotechnology 1 8(3): 333-338) (McGarvey, 
P. B.. etal. 1 995 B/o-Tec/jnotogy 13(13): 1484-1487; Bardor, M., etal. 1999 Trends in Plant Science 4{9): 376-380); 

4. insect cells {Spodoptera frugiperda Sf9, Sf21 , Trichopiusia ni, etc. in combination with recombinant baculoviruses 
25 such as Autographa caVifbrnica multiple nuclear polyhedrosis virus which infects lepldopteran cells) (Altmans et 

al., 1999 Glycoconj. J, 16(2):109-123). 

[0016] Recombinant human proteins expressed In the above-mentioned host systems may still include non-human 
glycofonns (Raju et al.. 2000 Annals Biochem. 283(2): 1 23-1 32). In particular, fraction of the /V-glycans may lack terminal 

30 sialic acid, typically found In human glycoproteins. Substantial efforts have been directed to developing processes to 
obtain glycoproteins that are as close as possible in structure to the human forms, or have other therapeutic advantages. 
Glycoproteins having specific glycofonns may be especially useful, for example in the targeting of therapeutic proteins. 
For example, the additton of one or more sialic acid residues to a glycan side chain may increase the lifetime of a 
therapeutic glycoprotein in vivo after admininstration. Accordingly, the mammalian host cells may be genetically engi- 

35 neered to increase the extent of tenninal sialic acid in glycoproteins expressed in the cells. Altematively sialic acid may 
be conjugated to the protein of interest in vitro prior to administration using a sialic acid transferase and an appropriate 
substrate. In addition, changes in growth medium composition or the expression of enzymes involved in human glyc- 
osylation have been employed to produce glycoproteins more closely resembling the human forms (S. Weikert, et al., 
Nature Biotechnology, 1999, 17,1116-1121; Werner. Noe. et al 1998 Arzneimittelforschung 48(8):870-880; Weikert 

40 Papac et al., 1999; Andersen and Goochee 1994 Cur. Opin.Biotechnol.5: 546-549; Yang and Butler 2000 Biotechnol. 
Bioengin.68{4): 370-380). Altematively cultured human cells may be used. 

[001 7] However, all of the existing systems have significant drawbacks. Only ceri^ain therapeutic proteins are suitable 
for expression In animal or plant systems (e.g. those lacking in any cytotoxic effect or other effect adverse to growth). 
Animal and plant cell culture systems are usually very slow, frequently requiring over a week of growth under carefully 
45 controlled conditions to produce any useful quantity of the protein of interest. Protein yields nonetheless compare 
unfavorably with those from microbial fermentation processes. In addition cell culture systems typically require complex 
and expensive nutrients and cofactors, such as bovine fetal serum. Furthennore growth may be limited by programmed 
cell death (apoptosis). 

[0018] Moreover, animal cells (particulariy mammalian cells) are highly susceptible to viral infection or contamination. 

50 In some cases the virus or other infectious agent may compromise the growth of the culture, while in other cases the 
agent may be a human pathogen rendering the therapeutic protein product unfit for its intended use. Furthemnore many 
cell culture processes require the use of complex, temperature-sensitive, animal-derived growth media components, 
which may carry pathogens such as bovine spongiform encephalopathy (BSE) prions. Such pathogens are diffrcult to 
detect and/or difficult to remove or sterilize without compromising the growth medium. In any case, use of animal cells 

55 to produce therapeutic proteins necessitates costly quality controls to assure product safety. 

[0019] Transgenic animals may also be used for manufacturing high-volume therapeutic proteins such as human 
senjm albumin, tissue plasminogen activator, monoclonal antibodies, hemoglobin, collagen, fibrinogen and others. 
While transgenic goats and other transgenic animals (mice, sheep, cows, etc.) can be genetically engineered to produce 
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therapeutic proteins at high concentrations in the milk, the process is costly since every batch has to undergo rigorous 
quality control. Animals may host a variety of animal or human pathogens, including bacteria, viruses, fungi, and prions. 
In the case of scrapies and bovine spongiform encephalopathy, testing can take about a year to rule out infection. The 
production of therapeutic compounds is thus preferably carried out in a well-controlled sterile environment e.g. under 

5 Good Manufacturing Practice (GMP) conditions. However, it Is not generally feasible to maintain animals in such en- 
vironments. Moreover, whereas cells grown in a fermenter are derived from one well characterized Master Cell Bank 
(MCB), transgenic animal technology relies on different animals and thus is inherently non-uniform. Furthermore ex- 
ternal factors such as different food uptake, disease and lack of homogeneity within a herd, may effect glycosylation 
pattems of the final product It is known in humans, for example, that different dietary habits result in differing glyco- 

10 sylation pattems. 

[0020] Transgenic plants have been developed as a potential source to obtain proteins of therapeutic value. However, 
high level expression of proteins in plants suffers from gene silencing, a mechanism by which the genes for highly 
expressed proteins are down-regulated In subsequent plant generations. In addition, plants add xylose and/or a- 
1 ,3-Iinked fucose to protein A(-glycans, resulting in glycoproteins that differ in structure from animals and are immuno- 
15 genie in mammals (Altmann, Marz et al., 1995 Glycoconj. J. 12(2);150-155). Furthermore, it Is generally not practical 
to grow plants in a sterile or GMP environment, and the recovery of proteins from plant tissues Is more costly than the 
recovery from fermented microorganisms. 

Glycoprotein Production Using Eukaryotic Microorganisms 

20 

[0021] The lack of a suitable expression system is thus a significant obstacle to the low-cost and safe production of 
recombinant human glycoproteins. Production of glycoproteins via the fermentation of microorganisms would offer 
numerous advantages over the existing systems. For example, femnentation-based processes may offer (a) rapid pro- 
duction of high concentrations of protein; (b) the ability to use sterile, well-controlled production conditions (e.g. GMP 

25 conditions); (c) the ability to use simple, chemically defined growth media; (d) ease of genetic manipulation; (e) the 
absence of contaminating human or animal pathogens; (f) the ability to express a wide variety of proteins, including 
those poorty expressed in cell culture owing to toxicity etc.; (g) ease of protein recovery (e.g. via secretion into the 
medium). In addition, fennentation facilities are generally far less costly to construct than cell culture facilities. 
[0022] As noted above, however, bacteria, including species such as Escherichia coli commonly used to produce 

30 recombinant proteins, do not glycosylate proteins in a specific manner like eukaryotes. Various methylotrophic yeasts 
such as Pichia pastoris, Pichia methanolica, and Hansenula polymorphs, are particulariy useful as eukaryotic expres- 
sion systems, since they are able to grow to high cell densities and/or secrete large quantities of recombinant protein. 
However, as noted above, glycoproteins expressed in these eukaryotic microorganisms differ substantially in W-glycan 
structure from those in animals. This has prevented the use of yeast or filamentous fungi as hosts for the production 

35 of many useful glycoproteins. 

[0023] Several efforts have been made to modify the glycosylation pathways of eukaryotic microorganisms to provide 
glycoproteins more suitable for use as mammalian therapeutic agents. For example, several glycosyitransferases have 
been separately cloned and expressed in S. cerevisiae (GalT, GnT I), Aspergillus nidulans (GnT I) and other fungi 
(Yoshida et al., 1 999, Kalsner et al., 1995 Glycoconj. J. 12(3):360-370. Schwientek et al.. 1995). However, TV-glycans 

40 with human characteristics were not obtained. 

[0024] Yeasts produce a variety of mannosyltransferases e.g. 1 ,3-mannosyltransf erases (e.g. MNN1 in S. cerevisiae) 
(Graham and Emr, 1991 J. Cell. Biol. 114(2):207-218), 1 ,2-mannosyltransferases (e.g. KTR/KRE family from S. cere- 
visiae), 1,6-mannosyltransferases (OCH1 from S. cerevisiae), mannosylphosphate transferases (MNN4 and MNN6 
from S. cerevisiae) and additional enzymes that are involved in endogenous glycosylation reactions. Many of these 

45 genes have been deleted individually, giving rise to viable organisms having altered glycosylation profiles. Examples 
are shown In Table 1 . 



Table 1. 



Examples of yeast strains having altered mannosylatlon 


Strain 


^-gtycan (wild 
type) 


(Mutation 


W-glycan 
(mutant) 


Reference 


S. pombe 


Man^gGlcNAcj 


OCH1 


Man8GicNAc2 


Yoko-oetal., 2001 FEBS 
Lett. 489(1 ):75-80 



{ 
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Table 1 . (continued) 





Examples of yeast strains having altered mannosylation 




Strain 


Af-glycan (wild 


Mutation 


^-glycan 


Reference 


5 




type) 




(mutant) 






S. cerevisiae 


Man>9GlcNAC2 


OCH1/MNN1 


Man8GtcNAC2 


Nakanishi-Shindo etal,. 1993 
J. Biol. Chem. 268(35): 
26338-26345 


10 


S. cerevisiae 


Man>9GlcNAc2 


OCH1/MNN1/MNN4 


Man8GIcNAc2 


Chiba et al., 1998 J. Biol. 
Chem. 273, 26298-26304 



[0025] In addition, Japanese Patent Application Public No. 8-336387 discloses an OCH1 mutant strain of Picbia 
^5 pastoris. The OCH1 gene encodes 1 ,6-mannosyltransferase, which adds a mannose to the glycan structure 
Man8GlcNAc2 to yield MangGlcNAcj. The MangGlcNACz structure is then a substrate for further mannosylation in vivo, 
leading to the hypermannosylated glycoproteins that are characteristic of yeasts and typically may have at least 30-40 
mannose residue per W-glycan. In the 0CH1 mutant strain, proteins glycosylated with Man8GlcNAc2 are accumulated 
and hypermannosylation does not occur. However, the structure Man8GlcNAc2 is not a substrate for animal glycosyla- 
te tion enzymes, such as human UDP-GlcNAc transferase I. and accordingly the method is not useful for producing 
proteins with human glycosylation patterns. 

[0026] Martinet et al. (Biotechnol. Lett. 1 998, 20(1 2), 1 1 71-11 77) reported the expression of a-1 ,2-mannosidase from 
Trichoderma reesei in R pastoris. Some mannose trimming from the W-glycans of a model protein was observed. 
However, the model protein had no A/-glycans with the structure Man5GlchiAc2, which would be necessary as an in- 
25 termediate for the generation of complex A/-glycans. Accordingly the method is not useful for producing proteins with 
human or animal glycosylation patterns. 

[0027] Similarly, Chiba et al. 1 998 expressed a-1 ,2-mannosidase from Aspergilius saitol in the yeast Saccharomyces 
cerevisiae. A signal peptide sequence (His-Asp-Glu-Leu) was engineered into the exogenous mannosidase to promote 
retention in the endoplasmic reticulum. In addition, the yeast host was a mutant lacking three en2:yme activities asso- 

30 ciated with hypemiannosylation of proteins: 1 ,6-mannosyltransferase {OCH1); 1 ,3-mannosyltransferase {MNN1); and 
mannosylphosphatetransferase (MNN4Y The /V-glycans of the triple mutant host thus consisted of the structure 
Man8GlcNAc2, rather than the high mannose fomns found in wild-type S. cerevisiae. In the presence of the engineered 
mannosidase, the A/-glycans of a model protein (carboxypeptidase Y) were trimmed to give a mixture consisting of 27 
mole % MangGlclMACg, 22 mole % MangGlcNACj. 22 mole % Man7GlcNAc2. 29 mole % MangGlcNAcg. Trimming of 

35 the endogenous cell wall glycoproteins was less efficient, only 10 mole % of the Af-glycans having the desired 
Man5GlcNAc2 structure. 

[0028] Since only the f^an5GlcNAc2 glycans would be susceptible to further enzymatic conversion to human glyco- 
forms, the method is not efficient for the production of proteins having human glycosylation patterns. In proteins having 
a single A/-glycosylatiori site, at least 73 mole % would have an incorrect structure. In proteins having two or three 
40 glycosylation sites, respectively at least 93 or 98 mole % would have an incorrect structure. Such low efficiencies of 
coversion are unsatisfactory for the production of therapeutic agents, particularty as the separation of proteins having 
different glycofonms is typically costly and difficult 

[0029] With the object of providing a more human-like glycoprotein derived from a fungal host. U.S. Patent No. 
5,834.251 to Maras and Contreras discloses a method for producing a hybrid glycopit>tein derived from Trichoderma 
45 reesei. A hybrid W-glycan has only mannose residues on the Mana1-6 arm of the core and one or two complex antennae 
on the Mana1-3 amn. While this structure has utility, the method has the disadvantage that numerous enzymatic steps 
must be performed in vitro, which is costly and time-consuming. Isolated enzymes are expensive to prepare and main- 
tain, may need unusual and costly substrates (e.g. UDP-GlcNAc), and are prone to loss of activity and/or proteolysis 
under the conditions of use. 

50 [0030] It is therefore an object of the present invention to provide a system and methods for humanizing glycosylation 
of recombinant glycoproteins expressed in Pichia pastoris and other lower eukaryotes such as Hansenula polymorpha, 
Pichia stiptis, Piciiia methanolica, Picbia sp, Kluyveromyces sp, Candida albicans, Aspergillus nidulans, and Trichode- 
rma reseei. 

55 SUMMARY OF THE INVENTION 



[0031] Cell lines having genetically modified glycosylation pathways that allow them to carry out a sequence of en- 
zymatic reactions, which mimic the processing of glycopre)teins in humans, have been developed. Recombinant pro- 
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teins expressed in these engineered hosts yield glycoproteins more similar, If not substantially identical, to their human 
counterparts.The lower eukaryotes, which ordinarily produce high-mannose containing W-glycans, including unicellular 
and multicellular fungi such as Pichia pastoris, Hansenula polymorpha, Pichia stiptis, Pichia methanolica, Pichia sp., 
Kluyveromyces sp.. Candida albicans, Aspergillus nidulans, and Trichoderma reseei, are modified to produce W-glycans 
such as Man5GlcNAc2 or other structures along human glycosylation pathways. This is achieved using a combination 
of engineering and/or selection of strains which: do not express certain enzymes which create the undesirable complex 
structures characteristic of the fungal glycoproteins, which express exogenous enzymes selected either to have optimal 
activity under the conditions present in the fungi where activity is desired, or which are targeted to an organelle where 
optimal activity is achieved, and combinations thereof wherein the genetically engineered eukaryote expresses multiple 
exogenous enzymes required to produce "human-like" glycoproteins. 

[0032] In a first embodiment, the microorganism is engineered to express an exogenous a-1 ,2-mannosidase enzyme 
having an optimal pH between 5.1 and 8,0, preferably between 5.9 and 7.5. In an alternative preferred embodiment, 
the exogenous enzyme is targeted to the endoplasmic reticulum or Golgi apparatus of the host organism, where it 
trims A/-glycans such as Man8GlcNAc2 to yield Man5GlcNAc2. The latter structure is useful because it is identical to a 
structure formed in mammals, especially humans; it is a substrate for further glycosylation reactions in vivo and/or in 
vitro that produce a finished W-glycan that is similar or identical to that formed in mammals, especially humans; and it 
is not a substrate for hypermannosylation reactions that occur in vivo in yeast and other microorganisms and that 
render a glycoprotein highly immunogenic in animals. 

[0033] In a second embodiment the glycosylation pathway of an eukaryotic microorganism Is modified by (a) con- 
structing a DNA library including at least two genes encoding exogenous glycosylation enzymes; (b) transforming the 
microorganism with the library to produce a genetically mixed population expressing at least two distinct exogenous 
glycosylation enzymes; (c) selecting from the population a microorganism having the desired glycosylation phenotype. 
In a preferred embodiment, the DNA library includes chimeric genes each encoding a protein localization sequence 
and a catalytic activity related to glycosylation. Organisms modified using the method are useful for producing glyco- 
proteins having a glycosylation pattern similar or identical to mammals, especially humans. 

[0034] In a third embodiment, the glycosylation pathway is modified to express a sugar nucleotide transporter en- 
zyme. In a preferred embodiment, a nucleotide diphosphatase enzyme is also expressed. The transporter and diphos- 
phatase improve the efficiency of engineered glycosylation steps, by providing the appropriate substrates for the gly- 
cosylation enzymes in the appropriate compartments, reducing competitive product inhibitton, and promoting tiie re- 
moval of nucleoside diphosphates. 

DESCRIPTION OF THE FIGURES 

[0035] 

Figure 1 A is a schematic diagram of typical fungal /V-glycosylation pathway. 
Figure 1B is a schematic diagram of a typical human Af-glycosylation pathway. 

DETAILED DESCRIPTION OF THE INVENTION 

[0036] The methods and recombinant lower eukaryotic strains described herein are used to make "humanized glyc- 
oproteins". The recombinant lower eukaryotes are made by engineering lower eukaryotes which do not express one 
or more enzymes involved in production of high mannose structures to express the enzymes required to produce 
human-like sugars. As used herein, a lower eukaryote is a unicellular or filamentous fungus. As used herein, a "hu- 
manized glycoprotein" refers to a protein having attached thereto N-glycans including less than four mannose residues, 
and the synthetic intermediates (which are also useful and can be manipulated further in vitro) having at least five 
mannose residues. In a preferred embodiment, the glycoproteins produced in the recombinant lower eukaryotic strains 
contain at least 27 mole % of the Man5 intermediate. This Is achieved by cloning In a better mannosidase. i.e.. an 
enzyme selected to have optimal activity under the conditions present in the organisms at the site where proteins are 
glycosylated, or by targeting the enzyme to the organelle where activity is desired. 

[0037] In a prefen^ed embodiment, eukaryotic strains which do not express one or more enzymes involved in the 
production of high mannose structures are used. These strains can be engineered or one of the many such mutants 
already described in yeasts, including a hypermannosylation-minus (OCHI) mutant in Pichia pasfons. 
[0038] The strains can be engineered one enzyme at a time, or a library of genes encoding potentially useful enzymes 
can be created, and those strains having enzymes with optimal activities or producing the most "human-like" glyco- 
proteins, selected. 

[0039] Lower eukaryotes that are able to produce glycoproteins having the attached W-glycan Man5GlcNAc2 are 
particularty useful since (a) lacking a high degree of mannosylatlon (e.g. greater than 8 mannoses per /V-glycan. or 
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especially 30-40 mannoses), they show reduced immunogenicity in humans; and (b) the A/-glycan is a substrate for 
further glycosyiation reactions to form an even more human-like glycoform, e.g. by the action of GlcNAc transferase I 
to form GlcNAcMansGlcNACz. MangGlclSlAcg must be formed in vivo in a high yield, at least transiently, since all sub- 
sequent glycosyiation reactions require Man5GlcNAc2 or a derivative thereof. Accordingly, a yield is obtained of greater 
5 than 27 mole %, more preferably a yield of 50-100 mole %, glycoproteins in which a high proportion of N-glycans have 
Man5GlcNAc2. It is then possible to perform further glycosyiation reactions in vitro, using for example the method of 
U.S. Patent No. 5,834,251 to Maras and Contreras. In a prefen-ed embodiment, at least one further glycosyiation 
reaction Is perfomied in vivo. In a highly preferred embodiment thereof, active forms of glycosylating enzymes are 
expressed in the endoplasmic reticulum and/or Golgi apparatus. 

10 

Host Microorganisms 

[0040] Yeast and filamentous fungi have both been successfully used for the production of recombinant proteins, 
both intracellular and secreted (Cereghino, J. L. and J. M. Cregg 2000 FEMS Microbiology Reviews 24(1): 45-66; 
15 Harkki, A., etal. 1 989 B/o-Techno/ogy 7(6): 596; Berka, R. M.. etal. ^9B2Abstr,Pape^sAme^. Chem,Soc.20Z: 121-BIOT; 
Svetina, M., et al. 2000 JBiotectinol. 76(2-3): 245-251. 

[0041] Although glycosyiation in yeast and fungi is very different than in humans, some common elements are shared. 
The first step, the transfer of the core oligosaccharide stmcture to the nascent protein, is highly conserved in all eu- 
karyotes including yeast, fungi, plants and humans (compare Figures 1 A and IB). Subsequent processing of the core 

20 oligosaccharide, however, differs significantly in yeast and Involves the addition of several mannose sugars. This step 
is catalyzed by mannosyltransferases residing in the Golgi (e.g. OCH1, MNT1, MNN1, etc.). which sequentially add 
mannose sugars to the core oligosaccharide. The resulting structure is undesirable for the production of humanoid 
proteins and it is thus desirable to reduce or eliminate mannosyl transferase activity. Mutants of S. cereWs/ae. deficient 
in mannosyl transferase activity (e.g. ochi or mnn9 mutants) have shown to be non-lethal and display a reduced 

25 mannose content In the oligosacharide of yeast glycoproteins. Other oligosacharide processing enzymes, such as 
mannosylphophate transferase may also have to be eliminated depending on the host's particular endogenous glyc- 
osyiation pattern. After reducing undesired endogenous glycosyiation reactions the formation of complex N-glycans 
has to be engineered into the host system. This requires the stable expression of several enzymes and sugar-nucleotlde 
transporters. Moreover, one has to locate these enzymes in a fashion such that a sequential processing of the maturing 

30 glycosyiation structure is ensured. 

Target Glycoproteins 

[0042] The methods described herein are useful for producing glycoproteins, especially glycoproteins used thera- 
35 peutically in humans. Such therapeutic proteins are typically administered by injection, orally, pulmonary, or other 
means. 

[0043] Examples of suitable target glycoproteins include, without nmltation: erythropoietin, cytoldnes such as inter- 
feron-a, interf^eron-p, interferon-7. interferon-o. and granulocyte-CSF, coagulation factors such as factor VIII. factor IX, 
and human protein C, soluble IgE receptor a-chain, IgG, IgM, urokinase, chymase. and urea trypsin inhibitor, IGF- 
40 binding protein, epidermal growth factor, growth hormone-releasing factor, annexin V fusion protein, angiostatin, vas- 
cular endothelial growth factor-2, myeloid progenitor inhibitory factor-1, amd osteoprotegerin. 

Method for Producing Glycoproteins Comprising the A/-glycan Mant;GlcNAc? 

45 [0044] The first step involves the selection or creation of a lower eukaryote that is able to produce a specific precursor 
structure of Man5GIcNAc2, which is able to accept in vivo GlcNAc by the action of a GlcNAc transferase L This step 
requires the formation of a particular isomeric structure of MangGlcNACj. This structure has to be formed within the 
cell at a high yield (in excess of 30%) since all subsequent manipulations are contingent on the presence of this pre- 
cursor. Man5GlcNAc2 structures are necessary for complex N-glycan formation, however, their presence is by no means 

50 sufficient, since Man5GlcNAc2 may occur in different isomeric forms, which may or may not serve as a substrate for 
GlcNAc transferase I. Most glycosyiation reactions are not complete and thus a particular protein generally contains 
a range of different cart>ohydrate structures (i.e. glycoforms) on its surface. The mere presence of trace amounts (less 
than 5%) of a particular structure like MangGlcNACg is of little practical relevance. It is the formation of a particular, 
GlcNAc transferase I accepting intermediate (Structure I) in high yield (above 30%), which is required. The formation 

55 of this intermediate is necessary and subsequently allows for the in vivo synthesis of complex N-glycans. 

[0045] One can select such lower eukaryotes from nature or alternatively genetically engineer existing fungi or other 
lower eukaryotes to provide the structure in vivo. No lower eukaryote has been shown to provide such structures in 
vivo in excess of 1 .8% of the total N-glycans (Maras et al., 1997), so a genetically engineered organism is prefen^d. 



8 



EP 1 522 590 A1 



Methods such as those described in U.S. Patent No. 5,595,900, may be used to identify the absence or presence of 
particular glycosyltransf erases, mannosidases and sugar nucleotide transporters in a target organism of interest. 

Inactivation of Fungal Glycosylation Enzymes such as 1 >2-<x>mannosidase 

5 

[0046] The method described herein may be used to engineer the glycosylation pattem of a wide range of lower 
eukaryotes (e.g. Hansenula polymorpha, Pichia stiptis, Pichia methanolica, Pichia sp, Kluyveromyces sp, Candida 
albicans, Aspergillus nidulans, Trichoderma reseei etc.). Pichia pastoris is used to exemplify the required manipulation 
steps. Similar to other lower eukaryotes, P.pastoris processes MangGlcNAc2 structures in the ER with a 1,2-a-man- 

10 nosidase to yield Man8GlcNAc2. Through the action of several mannosyltransferases, this structure is then converted 
to hypermannosylated structures (Man>9GlcNAc2), also known as mannans. In addition, it has been found that P. 
paston's is able to add non-terminal phosphate groups, through the action of mannosylphosphate transferases to the 
carbohydrate structure. This is contrary to the reactions found in mammalian cells, which involve the removal of man- 
nose sugars as opposed to their addition. It is of particular importance to eliminate the ability of the fungus to hyper- 

15 mannosylate the existing Man8GlcNAc2 structure. This can be achieved by either selecting for a fungus that does not 
hypermannosylate, or by genetically engineering such a fungus. 

[0047] Genes that are involved in this process have been identifred in Pichia pastoris and by creating mutations in 
these genes one is able to reduce the production of "undesirable" glycoforms. Such genes can be identified by homology 
to existing mannosyltransferases (e.g. OCH1, MNN4, MNN6. MNN1), found in other lower eukaryotes such as C. 

20 albicans, Pichia angusta or S.cerevisiae or by mutagenizing the host strain and selecting for a phenotype with reduced 
mannosylation. Based on homologies amongst known mannosyltransferases and mannosylphosphate transferases, 
one may either design PGR primers, examples of which are shown in Table 2, or use genes or gene fragments encoding 
such enzymes as probes to identify homologues in DMA libraries of the target organism. Alternatively, one may be able 
to complement particular phenotypes in related organisms. For example, in order to obtain the gene or genes encoding 

25 1 ,6-mannosyltransferase activity in P. pastoris, one would carry out the following steps. 0CH1 mutants of S. cerevisiae 
are temperature sensitive and are slow growers at elevated temperatures. One can thus identify functional homologues 
of OCH1 in P.pastoris by complementing an 0CH1 mutant of S. cerevisiae with a P.pastoris DNA or cDNA library. 
Such mutants of S.cerevisiae may be found at http://genome-www.stanford.edu/Saccharomyces/ and are commercially 
available at http://www.resgen.com/products/YEASTD.php3. Mutants that display a normal growth phenotype at ele- 

30 vated temperature, after having been transformed with a P.pastoris DNA library, are likely to carry an OCH1 homologue 
of P.pastoris. Such a library can be created by partially digesting chromosomal DNA of P.pastoris with a suitable 
restriction enzyme and after inactivating the restriction enzyme ligating the digested DNA into a suitable vector, which 
has been digested with a compatible restriction enzyme. Suitable vectors are pRS31 4, a low copy (CEN6/ARS4) plas- 
mid based on pBluescript containing the Trp1 maricer (Sikorski. R S., and Hieter, P..1989. Genetics 122. pg 19-27) or 

35 pFL44S, a high copy (2^) plasmid based on a modified pUC19 containing the URA3 mariner (Bonneaud, N., et al., 
1 991 , Yeast 7, pg. 609-61 5). Such vectors are commonly used by academic researchers or similar vectors are available 
from a number of different vendors such as Invitrogen (Carisbad, CA), Pharmacia (Piscataway, NJ). New England 
Biolabs (Beveriy, MA). Examples are pYES/GS. 2fi origin of replication based yeast expression plasmid from Invitrogen, 
or Yep24 cloning vehicle from New England Blolabs. After ligation of the chnjmosomal DNA and the vector one may 

40 transform the DNA library into strain of S,cerevisiae with a specific mutation and select for the correction of the con-e- 
sponding phenotype. After sub-cloning and sequencing the DNA fragment that is able to restore the wild-type pheno- 
type. one may use this fragment to eliminate the activity of the gene product encoded by OCH1 in P. pastoris. 
[0048] Alternatively, if the entire genomic sequence of a particular fungus of interest is known, one may identify such 
genes simply by searching publicly available DNA databases, which are available from several sources such as NCBI, 

45 Swissprot etc. For example by searching a given genomic sequence or data base with a known 1 ,6 mannosyltransferase 
gene (0CH1) from S. cerevisiae, one can able to identify genes of high homology in such a genome, which a high 
degree of certainty encodes a gene that has 1,6 mannosyltransferase activity. Homologues to several known manno- 
syltransferases from S.cerevisiae In P.pastorishave been identified using either one of these approaches. These genes 
have similar functions to genes involved in the mannosylation of proteins in S. cerevisiae and thus their deletion may 

50 be used to manipulate the glycosylation pattem in P.pastoris or any other fungus with similar glycosylation pathways. 
[0049] The creation of gene knock-outs, once a given target gene sequence has been determined, is a well-estab- 
lished technique in the yeast and fungal molecular biology community, and can be carried out by anyone of ordinary 
skill in the art (R. Rothsteins, (1991) Methods in Enzymology. vol. 194, p. 281 ). In fact, the choice of a host organism 
may be influenced by the availability of good transformation and gene disruption techniques for such a host. If several 

55 mannosyltransferases have to be knocked out. the method developed by Alani and Kleckner allows for the repeated 
use of the URA3 markers to sequentially eliminate all undesirable endogenous mannosyltransferase activity. This 
technique has been refined by others but basically involves the use of two repeated DNA sequences, flanking a counter 
selectable marker. For example: URA3 may be used as a marker to ensure the selection of a transformants that have 
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integrated a construct. By flanking the URA3 marker with direct repeats one may first select for transformants that have 
integrated the construct and have thus disrupted the target gene. After isolation of the transformants, and their char- 
acterization, one may counter select in a second round for those that are resistant to 5'FOA. Colonies that able to 
survive on plates containing 5TOA have lost the URA3 marker again through a crossover event involving the repeats 
5 mentioned earlier. This approach thus allows for the repeated use of the same marker and facilitates the disruption of 
multiple genes without requiring additional markers. 

[0050] Eliminating specific mannosyltransferases. such as 1,6 mannosyltransferase (OCH1), mannosylphosphate 
transferases (MNN4, MNN6. or genes complementing Ibd mutants) in R pastoris, allows for the creation of engineered 
strains of this organism which synthesize primarily Man8GlcNAc2 and thus can be used to further modify the glyco- 
10 sylation pattem to more closely resemble more complex human glycofomi structures. A preferred embodiment of this 
method utilizes known DNA sequences, encoding known biochemical glycosylation activities to eliminate similar or 
identical biochemical functions in P. pastoris, such that the glycosylation structure of the resulting genetically altered 
R pastoris strain is modified. 

15 

Table 2. 
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PCRonmerA 


jPCRDiiiner B 


TareetGenefs>mA 


' Hcimoloeaes 






pastoris 




ATGGCGTIAGGCAGA 
TGGCAGT 


TXAGTCCTTCCAAC 


1.6- 

xoannosyllnais&rase 


OCHl 
S.cereyisiae, 
Pichia albicans 


TTCCTTC 


RCmSAYAXmA. 


GGRICHCCCCAHCK 


1.2 

manno^RRuisfisase 
s 


KTR/ECREftmily, 
S,cerevisiae 



Legend: M « A or C, R « A or Q, W » A or S » c or 



y a C or T, K a G or T, V = A or C or E a a or C or T, D » 
A or G or T, B a C or G or T, 
35 N a S or A or t or C. 



Incorporation of a Mannosidase into the Genetically Engineered Host 

40 [0051] The process described herein enables one to obtain such a structure in high yield for the purpose of modifying 
it to yield complex N-glycans. A successful scheme to obtain suitable MangGlcNAcj stnjctures must involve two parallel 
approaches: (1 ) reducing endogenous mannosyltransferase activity and (2) removing 1 ,2-a-mannose by mannosidas- 
es to yield high levels of suitable MangGlcNAcj structures. What distinguishes this method from ttie prior art Is that it 
deals directly with those two issues. As the woric of Chlba and coworicers demonstrates, one can reduce Man8GlcNAc2 

45 structures to a Man5GlcNAc2 isomer in S. cerevisiae, by engineering the presence of a fungal mannosidase from A. 
saitoi into the ER The shortcomings of their approach are twofold: (1 ) insufficient amounts of Man5GlcNAc2 are formed 
in the extra-cellular glycoprotein fraction (10%) and (2) it is not clear ttiat tiie in vivo formed MangGlcNACg structure in 
fact is able to accept GlcNAc by action of GlcNAc transferase I. If several glycosylation sites are present in a desired 
protein the probability (P) of obtaining such a protein in a correct form follows the relationship P=(F)", vrfiere n equals 

so the number of glycosylation sites, and F equals tiie fraction of desired glycoforms. A glycoprotein with tiiree glycosyla- 
tion sites would have a 0.1 % chance of providing the appropriate precursors for complex and hybrid N-glycan process- 
ing on all of its glycosylation sites, which limits the commercial value of such an approach. 

[0052] Most enzymes that are active in the ER and Golgi apparatus of S.cerevisiae have pH optima that are between 
6.5 and 7.5 (see Table 3). All previous approaches to reduce mannosylation by the action of recombinant mannosidases 
55 have concentrated on enzymes that have a pH optimum around pH 5.0 (Martinet et al., 1998. and Chiba et al., 1998), 
even though the activity of tiiese enzymes is reduced to less than 1 0% at pH 7.0 and thus most likely provide insufficient 
activity at their point of use, the ER and eariy Golgi of P.pastoris and S. cerevisiae, A [preferred process utilizes an a- 
mannosidase in vivo, where the pH optimum of the mannosidase Is witiiin 1 A pH units of the average pH optimum of 
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other representative marker enzymes localized in the same organene(s). The pH optimum of the enzyme to be targeted 
to a specific organelle should be matched with the pH optimum of other enzymes found in the same organelle, such 
that the maximum activity per unit enzyme is obtained. Table 3 summarizes the activity of mannosidases from various 
sources and their respective pH optima. Table 4 summarizes their location. 



5 

Table 3. 





Mannosidases and their pH optimum. 


10 


Source 


Enzyme 


pH 

optimum 


Reference 




Aspergillus saitoi 


1 ,2-a'mannosidase 


5.0 


Ichishima et al., 1999 Biochem. J. 
339(Pt 3):589-597 


15 


Trichoderma reesei 


1 .2-a-mannosidase 


5.0 


Maras et al.. 2000 J. Blotechnol. 77 
(2-3):255-263 




Penicillium cftrinum 


1 ,2-a-D-mannosidase 


5.0 


Yoshida et aL, 1 993 Biochem. J. 
290(Pt 2):349-354 


20 


Aspergillus nidulans 


1 ,2-aHfnannosidase 


6.0 


Eades and Hintz. 2000 




Homo sapiens IA(Golgi) 


1 ,2-a-mannosidase 


6.0 




25 


Homo sapiens IB (Golgi) 


1 ,2-a-mannosidase 


6.0 






Lepidopteran insect cells 


Type 1 1,2-a-Mang*mannosidase 


6.0 


Ren et al., 1995 Biochem. 34(8): 
2489-2495 


30 


Homo sapiens 


a-D-mannosidase 


6.0 


Chandrasekaran et al.. 1 984 
Cancer Res. 44(9):4059-68 




Xanthomonas manihotis 


1 ,2.3-a-mannosidase 


6.0 




35 


Mouse IB (Golgi) 


1 ,2-a-mannosidase 


6.5 


Schneikert and Herscovics, 1 994 
Glycobiology. 4(4):445-50 


40 


Bacillus sp. (secreted) 


1 ,2-a-D-mannosidase 


7.0 


Mamyama etal., 1994 
Carbohydrate Res. 251:89-98 



[0053] When one attempts to trim high mannose structures to yield Man5GlcNAc2 in the ER or the Golgi apparatus 
of S. cerevisiae, one may choose any enzyme or combination of enzymes that (1) has/have a sufficiently close pH 
optimum (i.e. between pH 5.2 and pH 7.8), and (2) is/are known to generate, alone or in concert, the specific isomeric 
Man5GlcNAc2 structure required to accept subsequent addition of GlcNAc by GnT I. Any enzyme or combination of 
enzymes that has/have shown to generate a structure that can be converted to GlcNAcMan5GlcNAc2 by GnT I in vitro 
would constitute an appropriate choice. This knowledge may be obtained from the scientific literature or experimentally 
by detemnining that a potential mannosidase can convert Man8GlcNAc2-PA to Man5GlcNAc2-PA and then testing, if 
the obtained Man5GlcNAc2~PA structure can serve a substrate for GnT I and UDP-GlcNAc to give GlcN AcMan5GlcNAC2 
in vitro. For example, mannosidase lA from a human or murine source would be an appropriate choice. 

1,2-Mannosidase Activity in the ER and Golgi 

[0054] Previous approaches to reduce mannosylatlon by the action of cloned exogenous mannosidases have failed 
to yield glycoproteins having a sufficient fraction (e.g. >27 mole %) of /V-glycans having the structure Man5GlcNAc2 
(Martinet et a!.. 1998, and Chiba et al., 1998). These enzymes should function efficiently in ER or Golgi apparatus to 
be effective in converting nascent glycoproteins. Whereas the two mannosidases utilized in the prior art (from A. saitoi 
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and T. reesei) have pH optima of 5.0. most enzymes that are active in the ER and Golgi apparatus of yeast (e.g. S. 
cerevisiae) have pH optima that are between 6.5 and 7.5 (see Table 3). Since the glycosylation of proteins is a highly 
evolved and efficient process, it can be concluded that the internal pH of the ER and the Golgi is also in the range of 
about 6-8. At pH 7.0, the activity of the mannosidases used In the prior art is reduced to less than 10%, which is 
5 insufTlcient for the efficient production of MangGlcNAcg in vivo. 



Table 4. 



Cellular location and pH optima of various glycosylation-related enzymes of S. cerevisiae. 


Gene 


Activity 


Location 


pH 


Author(5) 








optimum 




Ktn 


a-1 ,2 mannosyltransferase 


Golgi 


7.0 


Romero et al., 1997 Biochem. J. 321 (Pt 2): 










289*295 


Mns1 


a-1 ,2-mannosidase 


ER 


6.5 




CWH41 


glucostdase t 


ER 


6.8 






mannosyltransferase 


Golgi 


7-8 


Lehele and Tanner, 1974 Blochim. Biophys. 










Acta 350(1 ):225-235 


Kre2 


a-1 ,2 mannosyltransferase 


Golgi 


6.5-9.0 


Romero etaL, 1997 



[0055] The a-1 ,2-mannosidase enzyme should have optimal activity at a pH between 5.1 and 8.0. In a preferred 
embodiment the enzyme has an optimal activity at a pH between 5.9 and 7.5. The optimal pH may be determined 
under in vitro assay conditions. Preferred mannosidases include those listed in Table 3 having appropriate pH optima, 
e.g. Aspergilius nidulans, Homo sapiens lA(Golgi), Homo sapiens IB (Golgi), Lepidopteran insect cells (IPLB-SF21 AE), 
Homo sapiens, mouse IB (Golgi), and Xanthomonas manihotis. In a preferred embodiment, a single cloned mannosi- 
dase gene is expressed in the host organism. However, in some cases it may be desirable to express several different 
mannosidase genes, or several copies of one particular gene, in order to achieve adequate production of Man5GlcNAc2. 
In cases where multiple genes are used, the encoded mannosidases should all have pH optima within the preferred 
range of 5.1 to 8.0, or especially between 5.9 and 7.5. In an especially preferred embodiment mannosidase activity is 
targeted to the ER or cis Golgi. where the early reactions of glycosylation occur. 

Formation of complex N-glycans 

[0056] A second step of the process involves the sequential addition of sugars to the nascent carbohydrate structure 
by engineering the expression of glucosyltransferases into the Golgi apparatus. This process first requires the functional 
expression of GnT I in the earty or medial Golgi apparatus as well as ensuring the sufficient supply of UDP-GlcNAc. 

Integration Sites 

[0057] Since the ultimate goal of this genetic engineering effort is a robust protein production strain that is able to 
perform well in an industrial fennentation process, the integration of multiple genes into the fungal chromosome involves 
careful planing. The engineered strain will most likely have to be transformed with a range of different genes, and these 
genes will have to be transformed in a stable fashion to ensure that the desired activity is maintained throughout the 
fermentation process. Any combination of the following enzyme activities will have to be engineered into the fungal 
protein expresston host: sialyltransferases, mannosidases, fucosyltransferases, galactosyltransferases, glucosyltrans- 
ferases, GlcNAc transferases, ER and Golgi specific transporters (e.g. sym and antiport transporters for UDP-galactose 
and other precursors), other enzymes involved in the processing of oligosaccharides, and enzymes involved in the 
synthesis of activated oligosaccharide precursors such as UDP-galactose, CMP-N-acetylneuraminic acid. At the same' 
time a number of genes which encode enzymes known to be characteristic of non-human glycosylation reactions, will 
have to be deleted. 

Targeting of glycosyltransferases to specific organelles: 

[O058] Glycosyltransferases and mannosidases line the inner (luminal) surface of the ER and Golgi apparatus and 
thereby provide a "catalytic" surface that allows for the sequential processing of glycoproteins as they proceed through 
the ER and Golgi networi^. In fact the multiple compartments of the cis, medial, and trans Golgi and the trans-Golgi 
Networi^ (TGN), provide the different localities in which the ordered sequence of glycosylation reactions can take place. 
As a glycoprotein proceeds from synthesis in the ER to full maturation in the late Golgi or TGN, it Is sequentially exposed 
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to different glycosldases, mannosidases and glycosyltransferases such that a specific carbohydrate structure may be 
synthesized. Much work has been dedicated to revealing the exact mechanism by which these enzymes are retained 
and anchored to their respective organelle. The evolving picture is complex but evidence suggests that stem region, 
membrane spanning region and cytoplasmic tail individually or in concert direct enzymes to the membrane of individual 
5 organelles and thereby localize the associated catalytic domain to that locus. 

f0059] Targeting sequences are well known and described in the scientific literature and public databases, as dis- 
cussed in more detail below with respect to libraries for selection of targeting sequences and targeted enzymes. 

Method for Producing a Library to Produce Modified Glycosylation Pathways 

10 

[0060] A library including at least two genes encoding exogeneous glycosylation enzymes is transformed into the 
host organism, producing a genetically mixed population. Transformants having the desired glycosylation phenotypes 
are then selected from the mixed population. In a preferred embodiment, the host organism is a yeast especially R 
pastoris, and the host glycosylation pathway is modified by the operative expression of one or more human or animal 

IS glycosylation enzymes, yielding protein W-glycans similar or identical to human glycoforms. In an especially preferred 
embodiment, the DNA library includes genetic constructs encoding fusions of glycosylation enzymes with targeting 
sequences for various cellular loci involved in glycosylation especially the ER, cis Golgi, medial Golgi, or trans Golgi. 
[0061] Examples of modifications to glycosylation which can be effected using method are: (1 ) engineering an eu- 
karyotic microorganism to trim mannose residues from Man8GlcNAc2 to yield MangGlcNAcj as a protein A/-glycan; (2) 

20 engineering an eukaryotic microorganism to add an W-acetylglucosamine (GlcNAc) residue to Man5GlcNAc2 by action 
of GlcNAc transferase I; (3) engineering an eukaryotic microorganism to functionally express an enzyme such as an 
W-acetylglucosamine transferase (GnT I. GnT 11, GnT III, GnT IV, GnT V, GnT VI), mannosidase II. fucosyltransf erase, 
galactosyl tranferase (GalT) or sialyltransferases (ST). 

[0062] By repeating the method, increasingly complex glycosylation pathways can be engineered into the target 
25 microorganism. In one preferred embodiment, the host organism is transformed two or more times with DNA libraries 
including sequences encoding glycosylation activities. Selection of desired phenotypes may be performed after each 
round of transformation or alternatively after several transformations have occurred. Complex glycosylation pathways 
can be rapidly engineered in this manner. 

30 DNA Libraries 

[0063] It is necessary to assemble a DNA library including at least two exogenous genes encoding glycosylation 
enzymes. In addition to the open reading frame sequences, it is generally preferable to provide each library construct 
with such promoters, transcription terminators, enhancers, ribosome binding sites, and other functional sequences as 

35 may be necessary to ensure effective transcription and translation of the genes upon transformation into the host 
organism. Where the host is Pichia pastoris, suitable promoters include, for example, the A0X1, A0X2, DAS, and P40 
promoters. It is also preferable to provide each construct with at least one selectable marker, such as a gene to impart 
drug resistance or to complement a host metabolic lesion. The presence of the marker is useful in the subsequent 
selection of transformants; for example, in yeast the URA3, HIS4, SUC2, G41S, BLA, or SH BLE genes may be used. 

40 [0064] In some cases the library may be assembled directly from existing or wild-type genes. In a preferred erhbod- 
iment however the DNA library is assembled from the fusion of two or more sub-libraries. By the in-frame ligation of 
the sub-libraries, it is possible to create a large number of novel genetic constructs encoding useful targeted glyco- 
sylation activities. For example, one useful sub-library includes DNA sequences encoding any combination of enzymes 
such as sialyltransferases, mannosidases, fucosyltransferases. galactosyltransferases, glucosyltransferases, and Glc- 

45 NAc transferases. Preferably, the enzymes are of human origin, although other mammalian, animal, or fungal enzymes 
are also useful. In a preferred embodiment, genes are truncated to give fragments encoding the catalytic domains of 
the enzymes. By removing endogenous targeting sequences, the enzymes may then be redirected and expressed in 
other cellular loci. The choice of such catalytic domains may be guided by the knowledge of the particular environment 
in which the catalytic domain is subsequently to be active. For example, if a particular glycosylation enzyme is to be 

50 active in the late Golgi, and all known enzymes of the host organism in the late Golgi have a certain pH optimum, then 
a catalytic domain is chosen which exhibits adequate activity at that pH. 

[0065] Another useful sub-library includes DNA sequences encoding signal peptides that result in iocalizatton of a 
protein to a particular location within the ER. Golgi, or trans Golgi network. These signal sequences may be selected 
from the host organism as well as from other related or unrelated organisms. Membrane-bound proteins of the ER or 
55 Golgi typically may include, for example, N-tenminal sequences encoding a cytosolic tail (ct), a transmembrane domain 
(and), and a stem region (sr). The ct, tmd, and sr sequences are sufficient individually or in combination to anchor 
proteins to the inner (lumenal) membrane of the organelle. Accordingly, a preferred embodiment of the sub-library of 
signal sequences includes ct, tmd, and/or sr sequences from these proteins. In some cases it is desirable to provide 
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the sub-library with varying lengths of sr sequence. This may be accomplished by PGR using primers that bind to the 
5' end of the DNA encoding the cytosolic region and employing a series of opposing primers that bind to various parts 
of the stem region. Still other useful sources of signal sequences include retrieval signal peptides, e.g. the tetrapeptides 
HDEL or KDEL, which are typically found at the C-terminus of proteins that are transported retrograde into the ER or 
5 Golgi. Still other sources of signal sequences include (a) type II membrane proteins, (b) the enzymes listed in Table 
3, (c) membrane spanning nucleotide sugar transporters that are localized in the Golgi, and (d) sequences referenced 
in Table 5. 



Table 5. 



Sources of useful compartmental targeting sequences 


Gene or Sequence 


Organism 


Function 


Location of Gene 








Product 


MnsI 


S. cerevisiae 


a-1 ,2-mannosidase 


ER 


0CH1 


S. cerevisiae 


1 ,6-mannosyltransferase 


Golgi (cis) 


MNN2 


S. cerevisiae 


1 ,2-mannosyltransferase 


Golgi (medial) 


MNN1 


S. cerevisiae 


1 ,3-mannosyltransferase 


Golgi (trans) 


OCH1 


R pastoris 


1 ,6-mannosyltransferase 


Golgi (cis) 


2,6 ST 


H. sapiens 


2,6-sialyltransferase 


trans Golgi networtc 


UDP-Gal T 


S. pombe 


UDP-Gal transporter 


Golgi 


Mnt1 


S. cerevisiae 


1 ,2-mannosyltransferase 


Golgi (cis) 


HDEL at C-terminus 


S. cerevisiae 


retrieval signal 


ER 



25 [0066] In any case, it is highly prefen-ed that signal sequences are selected which are appropriate for the enzymatic 
activity or activities which are to be engineered into the host For example, in developing a modified microorganism 
capable of tenninal slalylation of nascent N-glycans, a process which occurs in the late Golgi in humans, it is desirable 
to utilize a sub-library of signal sequences derived from late Golgi proteins. Similarly, the trimming of MangGlcNAcj by 
an a-1 ,2-mannosidase to give Man5GlcNAc2 is an early step in complex /V-glycan formation in humans. It is therefore 

30 desirable to have this reaction occur in the ER or early Golgi of an engineered host microorganism. A sub-library 
encoding ER and early Golgi retention signals is used. 

[0067] In a preferred embodiment, a DNA library is then constructed by the in-frame ligation of a sub-library including 
DNA encoding signal sequences with a sub-library including DNA encoding glycosylation enzymes or catalytically 
active fragments thereof. The resulting library includes synthetic genes encoding fusion proteins. In some cases it is 
35 desirable to provide a signal sequence at the N-terminus of a fusion protein, or in other cases at the C-terminus. In 
some cases signal sequences may be inserted within the open reading frame of an enzyme, provided the protein 
structure of individual folded domains is not dismpted. 

[0068] The method is most effective when a DNA library transformed into the host contains a large diversity of se- 
quences, thereby increasing the probability that at least one transformant will exhibit the desired phenotype. Accord- 
^ ingly. prior to transformation, a DNA library or a constituent sub-library may be subjected to one or more rounds of 
gene shuffling, error prone PCR. or in vitro mutagenesis. 

Trarisformation 

45 [0069] The DNA library is then transformed into the host organism. In yeast, any convenient method of DNA transfer 
may be used, such as electroporation, the lithium chloride method, or the spheroplast method. To produce a stable 
strain suitable for high-density fermentation, it is desirable to integrate the DNA library constnjcts into the host chro- 
mosome. In a preferred embodiment, integration occurs via homologous recombination, using techniques known in 
the art. For example, DNA library elements are provided with flanking sequences homologous to sequences of the 

50 host organism. In this manner integration occurs at a defined site in the host genome, without dismption of desirable 
or essential genes. In an especially preferred embodiment, library DNA is integrated into the site of an undeslred gene 
in a host chromosome, effecting the disruption or deletion of the gene. For example, integration into the sites of the 
OCH7, MNN1, or MNN4 genes allows the expression of the desired library DNA while preventing the expression of 
enzymes involved in yeast hypermannosylation of glycoproteins. In other embodiments, library DNA may be introduced 

55 into the host via a chromosome, plasmid, retroviral vector, or random integration into the host genome. In any case, it 
is generally desirable to include with each library DNA constmct at least one selectable martcer gene to allow ready 
selection of host organisms that have been stably transformed. Recyclable maricer genes such as ura3, which can be 
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selected for or against are especially suitable. 
Selection Process 

5 [0070] After transformation of the host strain with the DNA library, transformants displaying the desired glycosylation 
phenotype are selected. Selection may be perfomied in a single-step or by a series of phenotypic enrichment and/or 
depletion steps using any of a variety of assays or detection methods. Phenotypic characterization may be carried out 
manually or using automated high-throughput screening equipment. Commonly a host microorganism displays protein 
A(-glycans on the cell surface, where various glycoproteins are localized. Accordingly intact cells may be screened for 

10 a desired glycosylation phenotype by exposing the cells to a lectin or antibody that binds specifically to the desired N- 
glycan. A wide variety of oligosaccharide-specific lectins are available commercially (EY Laboratories, San Mateo, 
CA). Alternatively, antibodies to specific human or animal /V-glycans are available commercially or may be produced 
using standard techniques. An appropriate lectin or antibody may be conjugated to a reporter molecule, such as a 
chromophore, fluorophore, radioisotope, or an enzyme having a chromogenic substrate (Guillen et al., 1998. Proc. 

15 NatlAcad. Sci USA 95(14): 7888-7892). Screening may then be performed using analytical methods such as spec- 
trophotometry, fluorimetry, fluorescence activated cell sorting, or scintillation counting. In other cases. It may be nec- 
essary to analyze Isolated glycoproteins or A/-glycans from transformed cells. Protein isolation may be candied out by 
techniques known In the art. In cases where an isolated W-glycan is required, an enzyme such as endo-p-W-acetylglu- 
cosaminidase (Genzyme Co., Boston, MA) may be used to cleave the A/-glycans from glycoproteins. Isolated proteins 

20 or A/-glycans may then be analyzed by liquid chromatography (e.g. HPLC), mass spectroscopy, or other suitable means. 
U.S. Patent No. 5,595,900 teaches several methods by which cells with desired extracellular carbohydrate structures 
may be identified. Prior to selection of a desired transformant. it may be desirable to deplete the transfonned population 
of cells having undesired phenotypes. For example, when the method is used to engineer a functional mannosidase 
activity into cells, the desired transformants will have lower levels of mannose in cellular glycoprotein. Exposing the 

25 transformed population to a lethal radioisotope of mannose in the medium depletes the population of transformants 
having the undesired phenotype, i.e. high levels of incorporated mannose. Alternatively, a cytotoxic lectin or antibody, 
directed against an undesirable N-glycan, may be used to deplete a transformed population of undesired phenotypes. 

Methods for Providing Sugar Nucleotide Precursors to the Golgi Apparatus 

30 

[0071] For a glycosyltransferase to function satisfactorily in the Golgi, it is necessary for the enzyme to be provided 
with a sufficient concentration of an appropriate nucleotide sugar, which is the high-energy donor of the sugar moiety 
added to a nascent glycoprotein. These nucleotide sugars to the appropriate compartments are provided by expressing 
an exogenous gene encoding a sugar nucleotide transporter in the host microorganism. The choice of transporter 
35 enzyme is influenced by the nature of the exogenous glycosyltransferase being used. For example, a GlcNAc trans- 
ferase may require a UDP-GlcNAc transporter, a fucosyltransferase may require a GDP-fucose transporter, a galac- 
tosyltransferase may require a UDP-galactose transporter, or a sialyltransferase may require a CMP-sialic acid trans- 
porter. 

[0072] The added transporter protein conveys a nucleotide sugar from the cytosol into the Golgi apparatus, where 
40 the nucleotide sugar may be reacted by the glycosyltransferase, e.g. to elongate an W-glycan. The reaction liberates 
a nucleoside diphosphate or monophosphate, e.g. UDP, GDP, or CMP. As accumulation of a nucleoside diphosphate 
inhibits the further activity of a glycosyltransferase, It is frequently also desirable to provide an expressed copy of a 
gene encoding a nucleotide diphosphatase. The diphosphatase (specific for UDP or GDP as appropriate) hydrolyzes 
the diphosphonucleoside to yield a nucleoside monosphosphate and inorganic phosphate. The nucleoside monophos- 
45 phate does not inhibit the glycotransferase and in any case is exported from the Golgi by an endogenous cellular 
system. Suitable transporter enzymes, which are typically of mammalian origin, are described below. 
[0073] The following items also characterize the Invention: 

1 . A method for producing glycoproteins having carbohydrate structures similar to those produced by human cells 
50 In a lower eukaryote comprising providing a unicellular or multicellular fungal host, which does not express one or 

more enzymes involved in production of high mannose structures, and infroducing into the host one or more en- 
zymes for production of a carbohydrate structure selected from the group consisting of Man5GlcNAc2, 
MangGlcNAc2 and MangGlcNAc2, wherein the enzymes are selected to have optimal activity at the pH of the 
location in the host where the cart>ohydrate structure Is produced or which are targeted to a subcellular location 
55 in the host where enzyme will have optimal activity to produce the cart>ohydrate structure. 

2. The method of item 1 wherein the host is deficient in the activity of one or more enzymes selected from the 
group consisting of mannosyltransferases and phosphomannosyltransferases. 
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3. The method of item 2 wherein the host does not express an enzyme selected from the group consisting of 1,6 
mannosyltransferase, 1 ,3 mannosyltransferase, and 1 ,2 mannosyltransferase. 

4. The method of Item 1 wherein the host is selected from the group consisting of Pichia pastoris, Pichia finlandica. 
5 Pichia trehalophila, Pichia l<odamae, Pichia membra naefaciens, Pichia opuntiae, Pichia thennnotolerans. Pichia 

salictaria, Pichia guercuum, Pichia pijperi, Pichia stiptis. Pichia methanolica, Pichia sp., Saccharomyces cerevi- 
siae, Saccharomyces sp.. Hansenula polymorpha, Kluyveromyces sp., Candida albicans, Aspergillus nidulans, 
and Trichodemna reesei. 

to 5. The method of item 2 wherein the host is an OCH1 mutant of P. pastoris. 

6. The method of item 1 comprising introducing into the host a nucleotide molecule encoding one or more man- 
nosldases involved in the production of Man5GlcNAc2 from Man8GlcNAc2 or MangGlcNAc2. 

15 7 The method of item 6 where the at least one mannosidase has a pH optimum within 1 .4 pH units of the average 

pH optimum of other representative enzymes in the organelle in which the mannosidase is localized, or having 
optimal activity at a pH between 5.1 and 8.0. 

8. The method of item 7 wherein the mannosidase enzyme has optimal activity at a pH between 5.9 and 7.5. 

20 

9. The method of item 8 wherein the mannosidase enzyme is an a1 ,2-mannosidase derived from mouse, human, 
Lepidoptera, Aspergillus nidulans, or Bacillus sp. 

10. The method of item 1 comprising providing a host that is able to fomi MangGlcNACz structures, displaying GnT 
25 I activity and having UDP-Gn transporter activity. 

11 . The method of item 1 comprising providing a host which has a UDP specific diphosphatase activity. 

12. The method of item 1 comprising introducing into the host one or more enzymes selected from the group 
30 consisting of mannosidases, glycosyltransferases and glycosidases, wherein the enzymes are targeted to the 

endoplasmic reticulum, the eariy, medial, late Golgi or the trans Golgi network. 

13. The method of item 12 wherein the mannosidase enzyme is predominantly localized in the Golgi apparatus or 
the endoplasmic reticulum. 

35 

14. The method of item 12 wherein the enzymes are localized by forming a fusion protein between a catalytic 
domain of the enzyme and a chimeric localization region encoded by at least one genetic construct formed by the 
in-frame ligation of a DNA fragment encoding a cellular targeting signal peptide with a DNA fragment encoding a 
glycosylation enzyme or catalytically active fragment thereof 

40 

15. The method of item 14 comprising providing a chimeric localization region from an enzyme selected from the 
group of mannosyltransferases, diphosphotases, proteases, GnT I. GnT II, GnT ill, GnT fV. GnT V, GnT VI, GalT, 
FT, and ST. 

45 16. The method of item 14 providing a catalytic domain encoding a glycosidase or glycosyltransferase selected 

from the group consisting of GnT I, GnT II, GnT III. GnT IV, GnT V, GnT VI, GalT. Fucosyltransferase and ST, that 
has a pH optimum within 1 .4 pH units of the average pH optimum of other representative enzymes in the organelle 
in which the enzyme is localized, or having optimal activity at a pH between 5.1 and 8.0. 

50 1 7. The method of item 1 comprising introducing into the host nucleotide molecules encoding one or more enzymes 

selected from the group of nucleoside sugar transporters consisting of UDP-GlcNAc transferase, UDP-galactos- 
yltransferase. GDP-fucosyltransferase. CMP-sialyltransferase. UDP-GlcNAc transporter. UDP-galactose trans- 
porter. GDP-fucose transporter, CMP-sialic acid transporter, and nucleotide diphosphatases. 

55 18. The method of item 17 comprising genetically engineering the fungal strain to remove UDP or GDP by action 

of a diphosphatase. 

19. The method of item 1 wherein the glycoprotein includes N-glycans of which greater than 27 mole percent 
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comprise fewer than six mannose residues. 

20. The method of item 1 wherein the glycoprotein comprises one or more sugars selected from the group consisting 
of galactose, sialic acid, and fucose. 

5 

21. The method of item 1 wherein the glycoprotein comprises at least one oligosaccharide branch comprising the 
stmcture NeuNAc-Gal-GlcNAc-Man. 

22. The method of item 1 wherein the glycoprotein comprises N-glycans having fewer than four mannose residues. 

10 

23. The method of item 1 wherein subsequent to isolation from the host, the glycoprotein is subjected to at least 
one further glycosylation or carboxylation reaction in vitro. 

24. The method of item 1 comprising the steps of (a) providing a DNA library comprising at least two genes encoding 
15 exogenous glycosylation enzymes; (b) transforming the host with the library to produce a genetically mixed pop- 
ulation expressing at least two distinct exogenous glycosylation enzymes; and (c) selecting from the population a 
host producing the desired glycosylation phenotype. 

25. The method of item 24 wherein the host Is transformed two or more times with the library prior to the selection 
20 of a desired glycosylation phenotype. 

26. The method of item 24 wherein the library comprises at least one wild-type gene encoding a glycosylation 
enzyme. 

25 27. The method of Item 24 wherein the library comprises at least one synthetic gene encoding a glycosylation 

enzyme. 

28. The method of item 24 wherein the library comprises at least one gene previously subjected to a technique 
selected from the list gene shuffling, In vitro mutagenesis, and error-prone polymerase chain reaction. 

30 

29. The method of item 24 wherein the library comprises at least one genetic constmct fonned by the in-frame 
ligation of a DNA fragment encoding a cellular targeting signal peptide with a DNA fragment encoding a glycosyla- 
tion enzyme or catalytically active fragment thereof. 

35 30. The method of item 29 wherein a DNA fragment encodes an activity selected from the group consisting of 

mannosidase, UDP-GlcNAc transferase, UDP-galactosyltransferase, and CMP-sialyltransferase and the cellular 
targeting signal peptide is predominantly localizes the enzyme in an organelle selected from the group consisting 
of endoplasmic reticulum, cis Golgi, medial Golgi, and trans Golgl. 

40 31 . The method of item 24 wherein the selection comprises the step of analyzing a glycosylated protein or isolated 

N-glycan by one or more methods selected from the group consisting of mass spectroscopy, liquid chromatography, 
characterizing cells using a fluorescence activated cell sorter, spectrophotometer, fluorimeter, or scintillation coun- 
ter, exposing host cells to a lectin or antibody having a specific affinity for a desired oligosaccharide moiety, and 
exposing cells to a cytotoxic or radioactive molecule selected from the group consisting of sugars, antibodies, and 

45 lectins. 

32. The host produced by the method of any of items 1-31 . 

33. The glycoprotein produced by the method of any of Items 1-31. 

50 

34. The library of items 24-30. 
Examples 

55 [0074] The use of the above general method may be understood by reference to the following non-limiting examples. 
Examples of preferred embodiments are also summarized In Table 6. 
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Example 1: Engineering of P. pastoris with a-1,2-iMannosidase to produce insulin. 

[0075] An a-1,2-mannosidase Is required for the trimming of Man8GlcNAc2 to yield Man5GlcNAc2, an essentia! in- 
termediate for complex /V-glycan formation. An OCH1 mutant of R pastoris is engineered to express secreted human 

5 interferon-a under the control of an aox promoter A DNA library is constructed by the in-frame ligation of the catalytic 
domain of human mannosidase IB (an a-1 ,2-mannosidase) with a sub-library including sequences encoding early Golgi 
localization peptides. The DNA library is then transformed into the host organism, resulting in a genetically mixed 
population wherein individual transformants each express interferon-|3 as well as a synthetic mannosidase gene from 
the library. Individual transformant colonies are cultured and the production of interferon is induced by addition of 

10 methanol. Under these conditions, over 90% of the secreted protein includes interferon-p. Supematants are purified 
to remove salts and low-molecular weight contaminants by Cig silica reversed-phase chromatography Desired trans- 
formants expressing appropriately targeted, active a-1 ,2-mannosidase produce interferon-p including A/-glycans of the 
structure Man5GlcNAc2. which has a reduced molecular mass compared to the interferon of the parent strain. The 
purified supematants including interferon-p are analyzed by MALDI-TOF mass spectroscopy and colonies expressing 

15 the desired form of interferon-p are identified. 

Example 2: Engineering of Strain to express GlcNAc Transferase I 

[0076] GlcNAc Transferase I activity is required for the maturation of complex A/-glycans. Man5GlcNAc2 may only 
20 be trimmed by mannosidase II. a necessary step in the formation of human glycofbrms. after the addition of GlcNAc 
to the terminal a-1 ,3 mannose residue by GlcNAc Transferase I (Schachter, 1991 Glycobiology 1(5):453-461). Accord- 
ingly a library is prepared including DNA fragments encoding suitably targeted GlcNAc Transferase I genes. The host 
organism is a strain, e.g. a yeast, that is deficient In hypermannosylation (e.g. an OCH1 mutant), provides the substrate 
UDP-GlcNAc in the Golgi and/or ER. and provides /S/-glycans of the structure MangGlcNAcj in the Golgi and/or ER. 
25 After transformation of the host with the DNA library, the transfonnants are screened for those having the highest 
concentration of terminal GlcNAc on the cell surface, or alternatively secrete the protein having the highest tenninal 
GlcNAc content. Such a screen is performed using a visual method (e.g. a staining procedure), a specific terminal 
GlcNAc binding antibody, or a lectin. Altematively the desired transformants exhibit reduced binding of certain lectins 
specific for tenninal mannose residues. 

30 

Example 3: Engineering of Strains with a IMannosidase II 

[0077] In another example, it is desirable in order to generate a human glycoform in a microorganism to remove the 
two remaining terminal mannoses from the structure GlcNAcMangGlcNAcj by action of a mannosidase II. A DNA library 

35 including sequences encoding cis and medial Golgi localization signals is fused in-frame to a library encoding man- 
nosidase II catalytic domains. The host organism is a strain, e.g. a yeast, that is deficient in hypermannosylation (e.g. 
an OCH1 mutant) and provides /V-glycans having the structure GlcNAcMan5GlcNAc2 in the Golgi and/or ER. After 
transformation, organisms having the desired glycosylation phenotype are selected. An in vitro assay is used in one 
method. The desired structure CricNAcManjGIcNAcj (but not the undesired GlcNAcMan5GlcNAc2) is a substrate for 

40 the enzyme GlcNAc Transferase II. Accordingly, single colonies may be assayed using this enzyme in vitro in the 
presence of the substrate, UDP-GlcNAc. The release of UDP is detennined either by HPLC or an enzymatic assay for 
UDP. Altematively radioactively labeled UDP-GlcNAc is used. 

[0078] The foregoing in vitro assays are conveniently performed on individual colonies using high-throughput screen- 
ing equipment Altematively a lectin binding assay is used. In this case the reduced binding of lectins specific for 
45 terminal mannoses allows the selection of transformants having the desired phenotype. For example, Galantus nivaiis 
lectin binds specifically to terminal a-1 ,3-mannose, the concentration of which is reduced in the presence of operatively 
expressed mannosidase II activity. In one suitable method. G. n/Va//s lectin attached to a solid agarose support (avail- 
able from Sigma Chemical. St Louis. MO) is used to deplete the transformed population of cells having high levels of 
terminal a-1 ,3-mannose. 

50 

Example 4: Engineering of organisms to express Sialyltransferase 

[0079] The enzymes a2,3-sialyltransferase and a2,6-sialyltransferase add terminal sialic acid to galactose residues 
in nascent human A/-glycans, leading to mature glycoproteins. In human the reactions occur in the trans Golgi orTGN. 
55 Accordingly a DNA library is constructed by the in-frame fusion of sequences encoding sialyltransferase catalytic do- 
mains with sequences encoding trans Golgi or TGN localization signals. The host organism is a strain, e.g. a yeast, 
that is deficient in hypermannosylation (e.g. an OCH1 mutant), which provides W-glycans having terminal galactose 
residues in the trans Golgi or TGN. and provides a sufficient concentration of CMP-slalic acid in the trans Golgi or 
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TGN. Following transformation, transformants having the desired phenotype are selected using a fluorescent antibody 

specific for /s/-glycans having a terminal sialic acid. 

Example 5: Method of engineering strains to express UOP-GlcNAc Transporter 

5 

[0080] The cD^4A of human Golgi UDP-GlcNAc transporter has been cloned by Ishida and coworkers. (Ishida, N., 
et al. 1999 J. Biochem.^2S{^): 68-77. Guillen and coworkers have cloned the canine kidney Golgi UDP-GlcNAc trans- 
porter by phenotypic correction of a Kluyveromyces lactis mutant deficient in Golgi UDP-GlcNAc transport (Guillen, 
E., et al. 1998). Thus a mammalian Golgi UDP-GlcNAc transporter gene has all of the necessary information for the 
10 protein to be expressed and targeted functionally to the Golgi apparatus of yeast. 

Example 6: Method of engineering strains to express GDP-Fucose Transporter. 

[0081] The rat liver Golgi membrane GDP-fucose transporter has been identified and purified by Puglielii. L and C. . 
15 B. Hirschberg 1999 J. Bio!. Chem. 274(50): 35 596-35600. The corresponding gene can be identified using standard 
techniques, such as N-terminal sequencing and Southern blotting using a degenerate DNA probe. The intact gene can 
is then be expressed in a host microorganism that also expresses a fucosyltransferase. 

Example 7: Method of engineering strains to express UDP-Galactose Transporter 

20 

[0082] Human UDP-galactose (UDP-Gal) transporter has been cloned and shown to be active in S. cerevisiae. (Kai- 
numa, M.. et al. 1999 Glycobiology 9{2): 133-141). A second human UDP-galactose transporter (hUGTI) has been 
cloned and functionally expressed in Chinese Hamster Ovary Cells. Aokl. K.. et al. 1999 JBiochem, 126(5): 940-950. 
Likewise Segawa and coworkers have cloned a UDP-galactose transporter from Schizosaccbaromyces pombe (Seg- 
25 awa. H., et al, 1 999 Febs Letters 451(3): 295-298). 

CMP'Siaiic Acid Transporter 

[0083] Human CMP-sialic acid transporter (hCST) has been cloned and expressed in Lec 8 CHO cells by Aoki and 
30 coworicers (1 999). Molecular cloning of the hamster CMP-sialic acid transporter has also been achieved (Eckhardt and 
Gerardy Schahn 1997 Eur. J. Biochem. 248(1): 187-192). The functional expression of the murine CMP-sialic acid 
transporter was achieved in Sacciiaromyces cerevisiae by Beminsone, P., et al. 1997 J. Bio!. C/?en7.272(19): 
12616-12619. 



Table6. 





Examples of preferred embodiments of the methods for modifying glycosylation in a eukaroytic microorganism, e. 
g. Pichia pastoris 


40 


Desired Structure 


Suitabie Catalytic 
Activities 


Suitable Sources of 
Locaiizatior) 
Sequences 


Suitable Gene 
Deletions 


Suitable 
Transporters 

and/or 
Phosptiatases 


45 

50 


Man5GlcNAc2 


a- 

1,2-mannosfdase 
(murine, human, 
Baciilus sp., A. 
niduians) 


Mnsi (N-terminus. S. 

cerevisiae) 

Ochi (N-terminus, S. 

cerevisiae, R pastohs) 

Ktrl Mnn9 Mnti (S. 

cerevisiae) 

KDEL. HDEL (C- 

terminus) 


OCHI 
MNN4 
MNN6 


none 
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Tabte6. (continued) 





Examples of preferred embodiments of the methods for modifying glycosylation in a eukaroytic microorganism, e. 




g. Pichia pastoris 










5 


Desired Structure 


Suitable Catalytic 


Suitable Sources of 


Suitable Gene 


Suitable 






Activities 


Localization 
Sequences 


Deletions 


Transporters 

andlor 
Phosphatases 


10 


GlcNAcMansGlcNAcs 


GlcNAc 


Och1 (N-terminus, S. 


OCH1 


UDP-GlcNAc 




Transferase 1, 


cerevisiae, P. pastoris) 


MNN4 


transporter 






(human, murine. 


KTR1 (N-terminus) 


MNN6 


(human, 






rat etc.) 


KDEL, 




murine, K. 
iactis) UDPase 


15 






(C-temiinus) 




(human) 






Mnni (N-terminus, S. 

cerevisiae) 

Mnt1 (N-lerminus, S. 

cerevi^ae) 






20 






GDPase (N-terminus, 










S. cerevisiae) 








GlcNAcMan3GlcNAc2 


mannosidase II 


Ktri 

Mnn1 (N-terminus, S. 


UUnl 
MNN4 
MNN6 


UUn-oICrMMC 

transporter 
(human, 


25 






Mnt1 (N-terminus, S. 

cerevisiae) 
Kre2/Mnt1 (S. 

Kre2 iP oastoris) 




murine, K. 
Iactis) UDPase 
(human) 


30 






Ktr1 (S. cerevisiae) 
Ktr1 {P. pastoris) 
Mnn1 (S. cerevisiae) 








GlcNAc#>« ^vMan<9GlcNAco 


GlcNAc 


Mnni (N-temiinus, S. 


OCH1 


UDP-GlcNAc 




Tr^n^fera^e II III 


cerevisiae^ 


MNN4 


transporter 


35 




IV. V (human, 
murine) 


Mnti (N-terminus, S. 
cerevisiae) 
Kre2/Mnt1 (S. 

r'ere\/i^iae\ 


MNN6 


(human, 
murine. K. 
Iactis) UDPase 
(human) 


40 






Kre2 (P. pastoris) 










Ktr1 (S. cerevisiae) 
Ktr1 (P. pastoris) 
Mnn1 (S. ce/BWS/ae) 








Ga!(i^)G!c NAC(2-4)- 


P-1,4-Galactosyi 


Mnn1 (N-terminus, S. 


OGH1 


UDP-Galactos 


45 


Man3GlcNAc2 


transferase 


cerevisiae) 


MNN4 


e transporter 


(human) 


Mnt1 (N-terminus, S. 
cerevisiae) 
Kre2/Mnt1 (S. 

cereWs/ae) 


MNN6 


(human, S. 
pombe) 


50 






Kre2 pastoris) 
Ktr1 (S. cerevisiae) 
Ktr1 pastoris) 
Mnnf (5. cerevisiae) 







55 
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Table6. (cxjntinued) 



Examples of preferred embodiments of the methods for modifying glycosylation in a eukaroytic microorganism, e. 
g. Pichia pastoris 


D&sir&d StructutB 


Activities 


Locaiization 
Sequences 


Suits his Gsns 

Deletions 


Transporters 

andlor 
Pi)Osphatases 


Gal(i^)GlcNAC(2_ 4)- 
Man3GlcNAc2 


a- 

2,6-SialyItransfer 
ase (human) a- 
2,3-Sialyltransfer 
ase 


KTR1 

MNN1 (N-terminus. S. 

cerevisiae) 

MNT1 (N-temiinus. S. 

cerevisiae) 

Kre2/Mnt1 (S. 

cerevisiae) 

Kre2 (R pastoris) 

Ktr1 (S. cerevisiae) 

Ktr1 (R pastoris) 

MNN1 (S. cerevisiae) 


OCH1 
MNN4 
IV1NN6 


CMP-Sialicacid 

transporter 

(human) 



Table 7: 





DNA and Protein Sequence Resources 


25 


1 


European Bioinformatics Institute (EBI) is a centre for research and services in bioinformatics: http: 






//www.ebi.acuk/ 




2 


Swissprot database: http://www.expasy.ch/spr 




3 


List of known glycosyltransferases and their origin. 


30 


P1.2 (GnTI)EC 2.4.1.101 




4 


human cDNA, Kumar et al (1990) Proc. Natl. Acad. Sci. USA 87:9948-9952 




5 


human gene, Hull et al (1991) Biochem. Biophys. Res. Commun. 176:608-615 




6 


mouse cDNA. Kumar et al (1992) Glycobiology 2:383-393 


35 


7 


mouse gene, Pownall et al (1992) Genomics 12:699-704 




8 


murine gene (5' flanking, non-coding). Yang et al (1994) Glycobiology 5:703-712 




9 


rabbit cDNA, Sarkar et al (1991) Proc. Natl. Acad. Sci. USA 88:234-238 




10 


rat cDNA, Fukada et al (1994) Biosci.Biotechnol. Biochem. 58:200-201 


40 


1,2 (GnT II) EC 2.4.1.143 




11 


human gene, Tan et al (1995) Eur. J. Biochem. 231:317-328 




12 


rat cDNA, D'Agostaro et al (1995) J. Biol. Chem. 270:15211-15221 




13 


pi ,4 (GnT III) EC 2.4.1.144 




14 


human cDNA, lhara et al (1993) J. Biochem. 113:692-698 


45 


15 


murine gene, Bhaumik et al (1995) Gene 164:295-300 




16 


rat cDNA, Nishikawa et al (1992) J. Biol. Chem. 267:18199-18204 




pi .4 (GnT IV) EC 2.4.1.145 


50 


17 


human cDNA, Yoshida et a! (1998) Glycoconjugate Journal 15:1115-1123 


18 


bovine cDNA, Minowa et al., European Patent EP 0 905 232 pi.6 (GnT V) EC 2.4.1.155 




19 


human cDNA, Saito et al (1994) Biochem. Biophys. Res. Commun. 198:318-327 




20 


rat cDNA. Shoreibah et al (1993) J. Biol. Chem. 268:15381-15385 


55 


P1,4 Galactosyltransferase, EC 2.4.1.90 (LacNAc synthetase) EC 2.4.1.22 (lactose synthetase) 


21 


bovine cDNA, D'Agostaro et al (1989) Eur. J. Biochem. 183:211-217 




22 


bovine cDNA (partial), Narimatsu et al (1986) Proc. Natl. Acad. Sci. USA 83:4720-4724 



( 
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Table 7: (continued) 



DNA and Protein Sequence Resources 


P1.4 Galactosyltransferase, EC 2.4.1.90 (LacNAc synthetase) EC 2.4.1.22 (lactose synthetase) 




Knv/in«ft rHMA /nartial\ Mac^ihav & Oasha M 989^ Pfoc Natl Acad. Sci. USA 86:5733-5377 


24 


t>ovine cDNA (5* end), Russo et al (1990) J. Biol. Chem. 265:3324 


25 


chicken cDNA (partial). Ghosh et al (1992) Biochem. Biophys. Res. Commun. 1215-1222 


26 


human cDNA, Masri et al (1988) Biochem. Biophys. Res. Commun. 157:657-663 


27 


human cDNA. (HeLa cells) Watzele & Berger (1990) Nucl. Acids Res. 18:7174 


28 


human cDNA, (partial) Uejima et al (1992) Cancer Res. 52:6158-6163 


29 


human cDNA, (carcinoma) Appert et al (1986) Biochem. Biophys. Res. Commun. 139:163-168 


30 


human gene. Mengle-Gaw et al (1991) Biochem. Biophys. Res. Commun. 176:1269-1276 


31 


murine cDNA, Nakazawa et al (1988) J. Biochem. 104:165-168 


32 


murine cDNA, Shaper et al (1988) J. Biol. Chem. 263:10420-10428 


33 


murine cDNA (novel), Uehara & Muramatsu unpublished 


34 


murine gene. Mollis et al (1989) Biochem. Biophys. Res. Commun. 162:1069-1075 


35 


rat protein (partial). Bendiak et al (1993) Eur. J. Biochem. 216:405-417 


2.3-Sialyltransferase. (ST3Gal II) (A/-linked) (Gal-1.3/4-GlcNAc) EC 2.4.99.6 


36 


human cDNA, Kitagawa & Paulson (1993) Biochem. Biophys. Res. Commun. 194:375-382 


37 


rat cDNA. Wen et al (1992) J. BioL Chem. 267:21011-21019 


2,6-Sialyltransferase. (STSGal 1) EC 2.4.99.1 


38 


chicken, Kurosawa et al (1994) Eur. J. Biochem 219:375-381 


39 


human cDNA (partial), Lance et al (1989) Biochem. Bfophys. Res. Commun. 164:225- 232 


40 


human cDNA, Gmndmann et al (1990) Mud. Acids Res. 18:667 


41 


human cDNA. ZettlmeisI et al (1992) Patent EP0475354-A/3 


42 


human cDNA. Stamenkovic et al (1990) J. Exp. Med. 172:641-643 (CD75) 


43 


human cDNA, Bast et al (1992) J. Cell Biol. 116:423-435 


44 


human gene (partial), Wang et al (1993) J. Biol. Chem. 268:4355^361 


45 


human gene (5' flank), Aasheim et al (1993) Eur. J. Biochem. 213:467-475 


46 


human gene (promoter), Aas-Eng et al (1995) Biochlm. Biophys. Acta 1261:166-169 


47 


mouse cDNA, Hamamoto et al (1993) Bioorg. Med. Chem. 1:141-145 


48 


rat cDNA, Weinstein et al (1987) J. Biol. Chem. 262:17735-17743 


49 


rat cDNA (transcript fragments), Wang et al (1 991 ) Glycobiology 1 :25-31 . Wang et al (1 990) J. Biol. 




Chem. 265:17849-17853 


50 


rat cDNA (5* end), O'Hanlon et al (1989) J. Biol. Chem. 264:17389-17394; Wfeing et al (1991) 




Glycobiology 1:25-31 


51 


rat gene (promoter), Svensson et al (1990) J. Biol. Chem. 265:20863-20688 


52 


rat mRNA (fragments). Wen et al (1992) J. BioL Chem. 267:2512-2518 



45 [0084] Additional methods and reagents which can be used in the methods for modifying the glycosylatlon are de- 
scribed In the literature, such as U.S. Patent No. 5,955.422. U.S. Patent No. 4.775.622, U.S. Patent No. 6,017,743, 
U.S. Patent No. 4,925.796. U.S. Patent No. 5.766.910. U.S. Patent No. 5.834.251. U.S. Patent No. 5.910.570. U.S. 
Patent No. 5,849,904, U.S. Patent No. 5,955.347, U.S. Patent No. 5.962.294. U.S. Patent No. 5.135.854. U.S. Patent 
No. 4.935,349, U.S. Patent No. 5,707,828, and U.S. Patent No. 5,047.335. 

50 [0085] Appropriate yeast expression systems can be obtained from sources such as the American Type Culture 
Collectfon. Rockville, MD. Vectors are commercially available from a variety of sources. 
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SEQUENCE USTTNG 
<110> Gemgross. Tillman U. 

<120> METHODS FOR PRODUCING MODIFIED GLYCOPROTEINS 
<130> GH 100 
<140> 09/892,591 
<141> 2001-06-27 

<150> 60/214,358 
<151> 2000-06-28 

<150> 60/215,638 
<151> 2000-06-30 

<150> 60/279,997 
<151> 2001-03-30 

<160> 6 

<170> Patentin version 3.1 

<210> 1 
<211> 21 
<212> DNA 

<213> Aitiflcia] Sequence 



<220> 
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<223> Primer A for target gene in P. pastoris (1,6-mannosyltransferase) 
<400> 1 

atggcgaagg cagatggcag t 21 

<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer B for target gene in P. pastoris (1,6-mannosyltransferase) 
<400> 2 

ttagtccttc caacttcctt c 21 

<210> 3 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer A for target gene in P. pastoris (1,2 mannosyltransferases 
)■ 

<220> 

<221> misc_feature 
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<222> {3)..(3) 

<223> where "n" is equal to "c" or "t" 
<22C» 

<221> inisc_featUFe 
<222> (7)..(7) 

<223> wherein "n" is equal to "a" or "c" 
<220> 

<221> nusc_feature 
<222> (9)..(9) 

<223> wherein "n" is equal to "a" or "t" or "g" or "c 
<220> 

<221> misc_feature 
<222> (12)..(12) 

<223> wherein "n" is equal to "a" or "t" or "g" or "c 
<220> 

<221> misc_feature 
<222> (15)..(15) 

<223> wherein "n" is equal to "a" or "g". 
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<220> 

<221> misc_feature 
<222> (17)..(17) 

<223> wherein "n" is equal to "c" or "t". 
<220> 

<221> inisc_feature 
<222> (18)..(18) 

<223> wherein "n" is equal to "a" or "t" or "c" or "] 
<220> 

<221> misc_feature 
<222> (21)..(21) 

<223> where "n" is equal to "c" or "t". 

<220> 

<22I> misc_feature 
<222> (24)..(24) 

<223> wherein "n" is equal to "a" or "c" or "t". 
<400> 3 

tantggngng tngancnnga natnaa 
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<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer for target gene in P. pastoris (1,2 mannosyltransferases). 
<220> 

<221> misc_feature 
<222> (3)..(3) 

<223> wherein "n" is equal to "a" or "g". 
<220> 

<221> inisc_feature 
<222> (6)..(6) 

<223> wherein "n" is equal to "a" or "t" or "g" or "c". 
<220> 

<221> misc.feature 
<222> (12)..(12) 

<223> wherein "n" is equal to "a" or "t" or "g" or "c". 



( 
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<220> 

<221> misc_feature 
<222> (14)..(14) 

<223> wherein "n" is equal to "g" or "C. 
<220> 

<221> misc_feature 
<222> (15)..(15) 

<223> wherein "n" is equal to "c" or "t". 
<220> 

<221> misc_feature 
<222> (18)..(18) 

<223> wherein "n" is equal to "a" or "g". 

<400> 4 

gcntcncccc ancnntcnta 

<210> 5 
<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Signal tetrapeptide 
<400> 5 

5 

His Asp Glu Leu 
1 

10 

<210> 6 
<211> 4 

IS 

<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> Signal tetrapeptide 

<400> 6 

30 Lys Asp Glu Leu 

1 



35 

Claims 



40 



1. A lower eukaryotlc host cell that does not display alpha-1 ,6 mannosyltransferase activity with respect to the N- 
glycan on a glycoprotein, having in its endoplasmic reticulum (ER) or Golgi apparatus 

(i) a hybrid enzyme comprising: 



(a) an exogenous mannosldase catalytic domain having optimal activity in said ER or Golgi; fused to 

(b) a cellular targeting signal peptide not normally associated with the catalytic domain of (a), wherein 
45 said cellular targeting signal peptide targets said exogenous mannosidase catalytic domain to said ER or 

Golgi apparatus; and 



(il) a hybrid enzyme comprising: 

50 (c) an exogenous GnT I catalytic domain having optimal activity in said ER or Golgi; fused to 

(d) a cellular targeting signal peptide not normally associated witti the catalytic domain of (c), wherein said 
cellular targeting signal peptide targets said exogenous GnT I catalytic domain to said ER or Golgi appa- 
ratus. 



55 2. The host cell of dalm 1 , wherein said catalytic domain has a-1 ,2-mannosidase activity. 

3. The host cell of claim 1 or 2, wherein said mannosldase comprises an a-1 .2-mannosldase catalytic domain derived 
from mouse, human, Lepidoptera, Aspergillus nidutans, Xanthomonas manibotis, or Bacillus sp. 



EP 1 522 590 A1 

4. The host cell of any one of claims 1 to 3. wherein the mannosidase has optinrial activity at a pH between 5.9 and 7.5. 

5. The host cell of any one of claims 1 to 4, further expressing one or more exogenous enzymes selected from the 
group consisting of mannosidases, glycosyltransferases and glycosidases. 

5 

6. The host cell of daim 5, wherein at least one additional exogenous enzyme for the production of a human-like 
glycoprotein is localized by forming a fusion protein between a catalytic domain of the exogenous enzyme and a 
cellular targeting signal peptide not normally associated with the catalytic domain that targets it to the endoplasmic 
reticulum, the early, medial or late Golgi. or the trans Golgi network of the host cell. 

10 

7. The host cell of claim 6. wherein said catalytic domain is from a glycosidase or glycosyitransf erase selected from 
GnT I, GnT H, GnT III, GnT IV, GnT V, GnT VI, GalT, fucosyltransferase. sialyltransf erase. 

8. The host cell of any one of claims 5 to 7, further expressing one or more exogenous enzymes selected from the 
IS group consisting of UDP-<3lcNAc transferase, UDP-galaclosyltransferase. GDP-fiiCOsyltransf erase, CMP-sialyl- 

transferase, UDP-GlcNAc transporter. UDP-galactose transporter, GDP-fucose transporter, CMP-sialic acid trans- 
porter, and nucleotide diphosphatases. 

9. The host cell of any one of claims 5 to 8, further expressing one or more exogenous enzymes selected from: a 
20 UDP-specific diphosphatase and a GDP-specific diphosphatase. 

10. The host cell of any one of claims 1 to 9, obtained by introducing nucleic acid molecules encoding said enzymes. 

1 1 . The host cell of daim 1 0. wherein at least one of the nucleic acid molecules encoding said enzymes is integrated 
25 into the host cell chromosome. 

12. The host cell of any one of claims 1 to 11, which is selected from the group consisting of PIchia pastorfs, Phhia 
finlandica, Picbia trehalophila, Pichia koclamae, Pichia membranaefadens, Pichia opuntiae, Pichia thermotolerans, 
Pichia salictaria, Pichia guerouum, Pichia pijperi, Pichia stiptis, Pichia methanoiica, Pichia sp., Saccharomyces 

30 cerevisiae, Saccharomyces sp., Hansenuia polymorpha, Kluyveromyces sp., Candida albicans, Aspergillus nidw- 

lans, and Trichoderma reesei. 

13. The host cell of any one of daims 1 to 12, which is additionally deficient in the activity of one or more enzymes 
selected from the group consisting of mannosyltransferases and phosphomannosyltransferases. 

35 

14. The host cell of claim 13, which does additionally not express an enzyme selected from the group consisting of 
1,6 mannosyltransferase; 1,3 mannosyltransf erase; and 1,2 mannosyltransferase with respect to the N-glycan. 

15. The host cell of any one of daims 1 to 14. wherein the host is an OCH1 mutant off? pastoris. 

40 

16. A method for producing the lower eukaryotic host cell of any one of daims 1 to 1 5. the method comprising the step 
of transforming a lower eukaryotic cell with 

(i) a nudefe add molecule encoding a hybrid enzyme comprising: 

45 

(a) an exogenous mannosidase catalytic domain having optimal activity in said ER or Golgi; fused to 

(b) a cellular targeting signal peptide not normally associated with the catalytic domain of (a), wherein 
said cellular targeting signal peptide targets said exogenous catalytic domain to said ER or Golgi appa- 
ratus; and 

50 

(ii) a nudeic acid molecule encoding a hybrid enzyme comprising: 

(c) an exogenous GnT I catalytic domain having optimal activity in said ER or Golgi; fused to 

(d) a cellular targeting signal peptide not normally associated with the catalytic domain of (c), wherein said 
55 cellular targeting signal peptide targets said exogenous GnT I catalytic domain to said ER or Golgi appa- 
ratus. 

17. A method for producing a human-like glycoprotein comprising expressing the glycoprotein in a host cell of any one 



30 
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of claims 1 to 15. 

18. The method of claim 17, wherein said glycoprotein comprises /V-glycans having fewer than six mannose residues. 

5 19. The method of claim 17 or 18, wherein said glycoprotein comprises one or more sugars selected from the group 
consisting of galactose, N-acetylglucosamine, sialic acid and fucose. 

20. The method of any one of claims 1 7 to 1 9, wherein said glycoprotein comprises at least one oligosaccharide branch 
comprising the structure NeuNAc-Gal-GlcNAc-Man. 

21. The method of any one of claims 17 to 20, wherein said glycoprotein is selected from the group consisting of 
erythropoietin, interferon-oc, interferon-P, Interferon-y, Interferon-©, granulocyte-CSF. factor VIII, factor IX, human 
protein C soluble IgE receptor a-chain, IgG, IgM; urokinase, chymase, urea trypsin inhibitor, IGF-binding protein, 
epidermal growth factor, growth hormone-releasing factor, annexin V fusion protein, angiostatin, vascular endothe- 
lial growth factor-2, myeloid progenitor Inhibitory factor-1 and osteoprotegerin. 

22. The method of any one of claims 1 7 to 21 . further comprising the step of isolating the human-like glycoprotein from 
the host 

20 23. The method of claim 22, further comprising the step of subjecting the isolated glycoprotein to at least one further 
glycosylation reaction in vitro, subsequent to its isolation from the host 

24. A method for pn^ducing a human-like glycoprotein in a lower eukaryotic host cell that does not display alpha-1 ,6 
mannosyltransferase activity with respect to the N-glycan on a glycoprotein, the method comprising the step of: 

25 

(a) expressing said human-like glycoprotein in a population of said host cells transformed with a DNA library 
comprising at least two different genetic constructs, at least one of which comprises a DNA fragment encoding 
a hybrid enzyme comprising: 

30 (i) a glycosidase or glycosyltransferase enzyme catalytic domain having optimal activity in said ER or 

Golgi; fused to 

(ii) a cellular targeting signal peptide not normally associated with the catalytic domain of (i), wherein said 
cellular targeting signal peptide targets the ER or Golgi apparatus of the host cell; and 

35 (b) selecting the host cell exhibiting a human-like glycoprotein with a desired human-like glycosylation pattem 

on said glycoprotein. 

25. The method of claim 24, wherein the library comprises at least one gene or genetic construct previously subjected 
to a technique selected from: gene shuffling, in vitro mutagenesis, and error-prone polymerase chain reaction. 

26. The method of claim 24 or 25, wherein the DNA fragment encoding the glycosylation enzyme catalytic domain has 
an activity selected from the group consisting of mannosWase, UDP-GlcNAc transferase, UDP-galactosyltrans- 
ferase, or glycosyltransferase selected from GnT I, GnT II, GnT III, GnT IV, GnT V, GnT VI, GalT, fucosyltransferase, 
sialyltransferase; and wherein the cellular targeting signal peptide localizes the enzyme predominantly in a host 
cell organelle selected from the group consisting of endoplasmic reticulum, cts Golgi, medial Golgi, and trans Golgi. 

27. The method of any one of claims 24 to 26, wherein the library comprises at least one DNA fragment encoding a 
wild-type glycosylation enzyme. 

50 28. The method of any one of claims 24 to 27, wherein the selection comprises the step of analyzing a glycosylated 
protein or isolated A(-glycan by one or more methods selected from the group consisting of: (a) mass spectroscopy 
such as MALDI-TOF-MS; (b) liquid chromatography; (c) characterizing cells using a fluorescence activated cell 
sorter, spectrophotometer, fluorimeter, or scintillation counter; (d) exposing host cells to a lectin or antibody having 
a specific affinity for a desired oligosaccharide moiety; and (e) exposing cells to a cytotoxic or radioactive molebule 

55 selected from the group consisting of sugars, antibodies and lectins. 

29. A transformed host cell produced in the method of any one of claims 1 6 or 24 to 28. 
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30. A human-like glycoprotein obtainable by any one of the methods of claims 17 to 28. which has been produced 
through glycosylatidn modification steps comprising at least one glycosylation modification step of producing at 
least 30 mole % of an N-^lycan that has acted as a substrate for GnT I in vivo, said human-like glycoprotein lacking 
one or more sugars selected from the group consisting of galactose, fucose and terminal sialic acid. 

5 

31 . A DN A library comprising at least two different genetic constructs, wherein at least one genetic construct comprises 
a DNA fragment encoding a cellular targeting signal peptide which targets to the endoplasmic reticulum or Golgi 
apparatus ligated in-frame with a DNA fragment encoding a glycosidase or glycosyltransferase enzyme or cata- 
lytically active fragment thereof. 

10 

32. A plurality of host cells comprising a DNA library of daim 31 . 
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Man al^ -Man al,6^ 

MmaU-Manal^ ^ Man PM-(ad*^p 1,4- GlcN^ V ER 

Man al^-Manal»2-Manal,3 . 

T 

Man al^-Man KtXfi ^ \ 
^ ManalA 

Man al,2-ManaI^-Manal3 

. 6 TncmnosvUransfercusefs) 



Manal.S 
I 

Man al^ -Mm al,S^ 



y Manal,6i^ 
/ 

Man al^*Manal»2-Manal^ 



Man al,3 y Man pM - GlcNAc pl,4 - GlcNAcpi - Asp 



1 



Man al^Man al^-Man alfi 

I . I aL2 memnosvltransferasefs) 

Man ai;2-Man al^-Man alfi 

Man al^-Man al^-Man alfi 
I 

Man al^-Man alfi ^ 

y Manal,fi 

Man al^ ^ Man pl,4 * GIcNAc PM - QkNAcpl - Asp 



Man ai;2-Man al,2-Man al^ 



Manal,3.Mmal>Manal^Manal.6 . I oJ,^ 7ncmnosyltransferase(s) 

I I Marmosvlphosphate 

^'^^lAm al^-Man al,2- Man alfi X transferases 

I 

ManaU -K)4-Manai;i-Man aU-Manal,6 



MBm al^ Man al^-Man alfi ^ 

^ ManalA 

Man alfi ^ Manpi,4-GlcNAc pl,4-01cKA£pi-A£p 

Man al,2-MBnaly2-MaDal,3 



Figure lA 



ycfoigi 
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}Am al^' Man, alfi 



MmaU- MmaU ' ^ Manpi,4-GlcNAcpi.4-GldNAcpi-Asp 

Man al^ - Man al^ - Man al^ * i 

J izl.2 mannosidase 
Manal^-ManaL»6 . ' y 



y Man al,6 ^ 



Man aU ' / Man pl,4 - GlcKAc pl,4 - GlcNAcpl • Asp 

Man ai;2-Manal^-Manal3 

Marmosidase lA IB. IC 

Manal,5 



y MjBnal,6 ^ 



2GnTI 



ManaW ' ^ Man pi.4- GlcNAcpi,4-GlcNAcpi.Asp 

Manal^ 

Manal.S ^ 

y Man al,6 * 

ManaM . ^ Manpi,4- QfcNAcpi,4-GlcNAcpi -Asp 

GlcNAcpi;2- ManaU 



i ML 



Manal,6 

GlcNAcpi^- ManaU 
GlcNAcpl^ ^ 



\ 



GlcKAc pl.4 
GlcKAc pl»4 \ 



y Man al,6 ^ 



MBnal,3 



GlcNAcpi^ / 
Gal pl.4- GlcNAcpi^ ^ 



^ Mannosidase n 
y Man pM -GlcNAc PM - GlcKAcpi - Asp 

1 

y Man pl,4 - GlcNAc pl.4 - GkNAcpi - Asp 
I Pl4GaIT 



m iGnTn. BL4GnTIV. 
pL 4GnTVI 



Gal pl.4- GicKAcpl.4 
Gal pl.4- GlcNAc pi.4 \ 



y Man al,6 ^ 



Maxial,3 



y Man pl.4 - GlcNAc pl,4 - GlcNAcpl - Asp 



1 



. Gaipi,4. GlckAcpi.2 / 
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